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Preface 


BRICK: Well, they say nature hates a vacuum, Big Daddy. 

BIG DADDY: That’s what they say, but sometimes I think 
that a vacuum is a hell of a lot better than some of the stuff 
that nature replaces it with. 

Tennessee Williams (1911-1983) Cat on a Hot Tin Roof 

Quantum field theory is arguably the most far-reaching and beautiful 
physical theory ever constructed. It describes not only the quantum vac¬ 
uum, but also the stuff that nature replaces it with. Aspects of quantum 
held theory are also more stringently tested, as well as verified to greater 
precision, than any other theory in physics. The subject nevertheless has 
a reputation for difficulty which is perhaps well-deserved; its practition¬ 
ers not only manipulate formidable equations but also depict physical 
processes using a strange diagrammatic language consisting of bubbles, 
wiggly lines, vertices, and other geometrical structures, each of which 
has a well defined quantitative significance. Learning this mathematical 
and geometrical language is an important initiation rite for any aspir¬ 
ing theoretical physicist, and a quantum held theory graduate course is 
found in most universities, aided by a large number of weighty quantum 
held theory textbooks. These books are written by professional quan¬ 
tum held theorists and are designed for those who aspire to join them 
in that profession. Consequently they are frequently thorough, serious 
minded and demand a high level of mathematical sophistication. 

The motivation for our book is the idea that quantum held theory is 
too important, too beautiful and too engaging to be restricted to the 
professionals. Experimental physicists, or theoretical physicists in other 
helds, would beneht greatly from knowing some quantum held theory, 
both to understand research papers that use these ideas and also to 
comprehend and appreciate the important insights that quantum held 
theory has to offer. Quantum held theory has given us such a radically 
different and revolutionary view of the physical world that we think 
that more physicists should have the opportunity to engage with it. 
The problem is that the existing texts require far too much in the way 
of advanced mathematical facility and provide too little in the way of 
physical motivation to assist those who want to learn quantum held 
theory but not to be professional quantum held theorists. The gap 
between an undergraduate course on quantum mechanics and a graduate 
level quantum held theory textbook is a wide and deep chasm, and one 
of the aims of this book is to provide a bridge to cross it. That being 
said, we are not assuming the readers of this are simple-minded folk who 
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1 After all, with the number of chapters 
we ended up including, we could have 
called it ‘Fifty shades of quantum field 
theory’. 


2 T.S. Eliot on Ulysses. 


can be fobbed off with a trite analogy as a substitute for mathematical 
argument. We aim to introduce all the maths but, by using numerous 
worked examples and carefully worded motivations, to smooth the path 
for understanding in a manner we have not found in the existing books. 

We have chosen this book’s title with great care. 1 Our imagined reader 
is an amateur, wanting to learn quantum field theory without (at least 
initially) joining the ranks of professional quantum field theorists; but 
(s)he is gifted, possessing a curious and adaptable mind and willing to 
embark on a significant intellectual challenge; (s)he has abundant cu¬ 
riosity about the physical world, a basic grounding in undergraduate 
physics, and a desire to be told an entertaining and intellectually stim¬ 
ulating story, but will not feel patronized if a few mathematical niceties 
are spelled out in detail. In fact, we suspect and hope that our book will 
find wide readership amongst the graduate trainee quantum field theo¬ 
rists who will want to use it in conjunction with one of the traditional 
texts (for learning most hard subjects, one usually needs at least two 
books in order to get a more rounded picture). 

One feature of our book is the large number of worked examples, which 
are set in slightly smaller type. They are integral to the story, and flesh 
out the details of calculations, but for the more casual reader the guts 
of the argument of each chapter is played out in the main text. To 
really get to grips with the subject, the many examples should provide 
transparent demonstrations of the key ideas and understanding can be 
confirmed by tackling the exercises at the end of each chapter. The 
chapters are reasonably short, so that the development of ideas is kept 
at a steady pace and each chapter ends with a summary of the key ideas 
introduced. 

Though the vacuum plays a big part in the story of quantum field the¬ 
ory, we have not been writing in one. In many ways the present volume 
represents a compilation of some of the best ideas from the literature 
and, as a result, we are indebted to these other books for providing 
the raw material for many of our arguments. There is an extensive list 
of further reading in Appendix A where we acknowledge our sources, 
but we note here, in particular, the books by Zee and by Peskin and 
Schroeder and their legendary antecedent: the lectures in quantum field 
theory by Sidney Coleman. The latter are currently available online as 
streamed videos and come highly recommended. Also deserving of spe¬ 
cial mention is the text by Weinberg which is ‘a book to which we are 
all indebted, and from which none of us can escape.’ 2 

It is a pleasure to acknowledge the help we have received from var¬ 
ious sources in writing this book. Particular mention is due to Sonke 
Adlung at Oxford University Press who has helped steer this project to 
completion. No authors could wish for a more supportive editor and 
we thank him, Jessica White and the OUP team, particularly Mike Nu¬ 
gent, our eagle-eyed copy editor. We are very grateful for the comments 
and corrections we received from a number of friends and colleagues who 
kindly gave up their time to read drafts of various chapters: Peter Byrne, 
Claudio Castelnovo, John Chalker, Martin Galpin, Chris Maxwell, Tom 
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McLeish, Johannes Moller, Paul Tulip and Rob Williams. They deserve 
much credit for saving us from various embarrassing errors, but any that 
remain are due to us; those that we find post-publication will be posted 
on the book’s website: 

http://www.dur.ac.uk/physics/qftgabook 

For various bits of helpful information, we thank Hideo Aoki, Nikitas Gi- 
dopoulos, Paul Goddard and John Singleton. Our thanks are also due to 
various graduate students at Durham and Oxford who have unwittingly 
served as guinea pigs as we tried out various ways of presenting this 
material in graduate lectures. Finally we thank Cally and Katherine for 
their love and support. 


TL & SJB 
Durham & Oxford 
January 2, 2014 
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Overture 



To begin at the beginning 
Dylan Thomas (1914-1953) 

Beginnings are always troublesome 
George Eliot (1819-1880) 


0.1 What is quantum field theory? 


0.1 

What is quantum field 
ory? 

the- 

1 

0.2 

What is a field? 

2 

0.3 

Who is this book for? 

2 

0.4 

Special relativity 

3 

0.5 

Fourier transforms 

6 

0.6 

Electromagnetism 

7 


Every particle and every wave in the Universe is simply an excitation 
of a quantum field that is defined over all space and time. 

That remarkable assertion is at the heart of quantum field theory. It 
means that any attempt to understand the fundamental physical laws 
governing elementary particles has to first grapple with the fundamentals 
of quantum field theory. It also means that any description of compli¬ 
cated interacting systems, such as are encountered in the many-body 
problem and in condensed matter physics, will involve quantum field 
theory to properly describe the interactions. It may even mean, though 
at the time of writing no-one knows if this is true, that a full theory 
of quantum gravity will be some kind of quantum upgrade of general 
relativity (which is a classical field theory). In any case, quantum field 
theory is the best theory currently available to describe the world around 
us and, in a particular incarnation known as quantum electrodynamics 
(QED), is the most accurately tested physical theory. For example, the 
magnetic dipole moment of the electron has been tested to ten significant 
figures. 

The ideas making up quantum field theory have profound conse¬ 
quences. They explain why all electrons are identical (the same ar¬ 
gument works for all photons, all quarks, etc.) because each electron is 
an excitation of the same electron quantum field and therefore it’s not 
surprising that they all have the same properties. Quantum field theory 
also constrains the symmetry of the representations of the permutation 
symmetry group of any class of identical particles so that some classes 
obey Fermi-Dirac statistics and others Bose-Einstein statistics. Inter¬ 
actions in quantum field theory involve products of operators which are 
found to create and annihilate particles and so interactions correspond 
to processes in which particles are created or annihilated; hence there 
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Fig. 1 A field is some kind of ma¬ 
chine that takes a position in space- 
time, given by the coordinates a^, and 
outputs an object representing the am¬ 
plitude of something at that point in 
spacetime. Here the output is the 
scalar (f)(x but it could be, for ex¬ 
ample, a vector, a complex number, a 
spinor or a tensor. 


is also the possibility of creating and destroying virtual particles which 
mediate forces. 


0.2 What is a field? 

This is all very well, but what is a field? We will think of a field as some 
kind of machine that takes a position in spacetime and outputs an ob¬ 
ject representing the amplitude of something at that point in spacetime 
(Fig. 1). The amplitude could be a scalar, a vector, a complex number, 
a spinor or a tensor. This concept of a field, an unseen entity which 
pervades space and time, can be traced back to the study of gravity due 
to Kepler and ultimately Newton, though neither used the term and 
the idea of action-at-a-distance between two gravitationally attracting 
bodies seemed successful but nevertheless utterly mysterious. Euler’s 
fluid dynamics got closer to the matter by considering what we would 
now think of as a velocity field which modelled the movement of fluid at 
every point in space and hence its capacity to do work on a test parti¬ 
cle imagined at some particular location. Faraday, despite (or perhaps 
because of) an absence of mathematical schooling, grasped intuitively 
the idea of an electric or magnetic field that permeates all space and 
time, and although he first considered this a convenient mental picture 
he began to become increasingly convinced that his lines of force had an 
independent physical existence. Maxwell codified Faraday’s idea and the 
electromagnetic field, together with all the paraphernalia of field theory, 
was born. 

Thus in classical physics we understand that gravity is a field, elec¬ 
tromagnetism is a field, and each can be described by a set of equations 
which governs their behaviour. The field can oscillate in space and time 
and thus wave-like excitations of the field can be found (electromagnetic 
waves are well-known, but gravity waves are still to be observed). The 
advent of quantum mechanics removed the distinction between what had 
been thought of as wave-like objects and particle-like objects. Therefore 
even matter itself is an excitation of a quantum field and quantum fields 
become the fundamental objects which describe reality. 


0.3 Who is this book for? 

Quantum field theory is undoubtedly important, but it is also noto¬ 
riously difficult. Forbidding-looking integrals and a plethora of funny 
squiggly Feynman diagrams are enough to strike fear in many a heart 
and stomach. The situation is not helped by the fact that the many ex¬ 
cellent existing books are written by exceedingly clever practioners who 
structure their explanations with the aspiring professional in mind. This 
book is designed to be different. It is written by experimental physicists 
and aimed at the interested amateur. Quantum field theory is too in¬ 
teresting and too important to be reserved for professional theorists. 
However, though our imagined reader is not necessarily an aspiring pro- 



0.4 


Special relativity 3 


fessional (though we hope quite a few will be) we will assume that (s)he 
is enthusiastic and has some familiarity with non-relativistic quantum 
mechanics, special relativity and Fourier transforms at an undergradu¬ 
ate physics level. In the remainder of this chapter we will review a few 
basic concepts that will serve to establish some conventions of notation. 


0.4 Special relativity 

Quantum fields are defined over space and time and so we need a proper 
description of spacetime, and so we will need to use Einstein’s special 
theory of relativity which asserts that the speed c of light is the same 
in every inertial frame. This theory implies that the coordinates of an 
event in a frame S and a frame S (moving relative to frame S at speed 
v along the x-axis) are related by the Lorentz transformation 


t 

II 

vx 

~ 

X 

"a 

II 

- vt), 

y 

= y , 



z = z, (1) 

where 7 = (1 — /3 2 ) -1 / 2 and (3 = v/c. Because the speed of light sets 
the scale for all speeds, we will choose units such that c = 1. For similar 
reasons 1 we will also set h = 1 . 

A good physical theory is said to be covariant if it transforms sen¬ 
sibly under coordinate transformations . 2 In particular, we require that 
quantities should be Lorentz covariant if they are to transform ap¬ 
propriately under the elements of the Lorentz group (which include the 
Lorentz transformations of special relativity, such as eqn 1). This will 
require us to write our theory in terms of certain well-defined mathe¬ 
matical objects, such as scalars, vectors and tensors . 3 

• Scalars: A scalar is a number, and will take the same value in 
every inertial frame. It is thus said to be Lorentz invariant. 
Examples of scalars include the electric charge and rest mass of a 
particle. 

• Vectors: A vector can be thought of as an arrow. In a particular 
basis it can be described by a set of components. If the basis 
is rotated, then the components will change, but the length of 
the arrow will be unchanged (the length of a vector is a scalar). 
In spacetime, vectors have four components and are called four- 
vectors. A four-vector is an object which has a single time-like 
component and three space-like components. Three-vectors will be 
displayed in bold italics, such as x or p for position and momentum 
respectively. The components of three-vectors are listed with a 
Roman index taken from the middle of the alphabet: e.g. x l , with 
i = 1,2,3. Four-vectors are made from a time-like part and a 
space-like part and are displayed in italic script, so position in 


1 In some of the early chapters we will 
include the factors of h and c so that 
the reader can make better contact with 
what they already know, and will give 
notice when we are going to remove 
them. 

2 A good counterexample is the two- 
component ‘shopping vector’ that con¬ 
tains the price of fish and the price of 
bread in each component. If you ap¬ 
proach the supermarket checkout with 
the trolley at 45° to the vertical, you 
will soon discover that the prices of 
your shopping will not transform ap¬ 
propriately. 

3 

We will postpone discussion of tensors 
until the end of this section. 
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4 These are written with c = 1. 


° Though strictly dp refers only to the 
^th component of the four-vector oper¬ 
ator <9, rather than to the whole thing, 
we will sometimes write a subscript (or 
superscript) in expressions like this to 
indicate whether coordinates are listed 
with the indices lowered or with them 
raised. 


^In full, this equation would be 



These unfortunate terms are due 
to the English mathematician J. J. 
Sylvester (1814-1897). Both types of 
vectors transform covariantly, in the 
sense of ‘properly’, and we wish to re¬ 
tain this sense of the word ‘covariant’ 
rather than using it to simply label one 
type of object that transforms properly. 
Thus we will usually specify whether 
the indices on a particular object are 
‘upstairs’ (like a or ‘downstairs’ (like 
dp4>) and their transformation proper¬ 
ties can then be deduced accordingly. 


spacetime is written x where x = (t,x). Components for four- 
vectors will be given a Greek index, so for example x M where fi = 
0,1,2,3. We say that the zeroth component, x°, is time-like. 


Example 0.1 

Some other examples of four-vectors 4 are: 

• the energy-momentum four-vector p = (E,p), 

• the current density four-vector j = ( p,j ), 

• the vector potential four-vector A = (V, A). 

The four-dimensional derivative operator dp is also a combination of a time-like part 
and a space-like part, and is defined 5 by 


_d_ _ Sd_ \ _ (d_ d_ d_ d _\ 

dxv \ dt ’ / \ dt ’ dx ’ dy ’ dz ) 


( 2 ) 


Note the lower index written in dp, contrasting with the upper index on four-vectors 
like which means that the four-dimensional derivative is ‘naturally lowered’. This 
is significant, as we will now describe. 


A general coordinate transformation from one inertial frame to another 
maps {x^} —> {x^}, and the vector a M transforms as 



(3) 


Here we have used the Einstein summation convention, by which 
twice repeated indices are assumed to be summed. 6 Certain other 
vectors will transform differently. For example, the gradient vector 
= d(j>/dx M transforms as 

= ( <9aT\ dj>_ 
dx 11 \ dx^ J dx v ' 


The jargon is that a M transforms like a contravariant vector and 
dcj)/dx M = d^cf) transforms like a covariant vector, 7 though we will 
avoid these terms and just note that has its indices ‘upstairs’ and 
d^cj) has them ‘downstairs’ and they will then transform accordingly. 

The Lorentz transformation (eqn 1) can be rewritten in matrix form 


( 


/ 

7 

-Pi 

0 

0 \ 

( * \ 

X 



-/?7 

7 

0 

0 

X 

y 



0 

0 

1 

0 

y 

\z) 


V 

0 

0 

0 

1 / 

V * / 


or for short as 

x^ = K^x u , ( 6 ) 

where A ,J „ = (dx 11 /dx v ) is the Lorentz transformation matrix. In the 
same way, the energy-momentum four-vector transforms as 


p» = A * vV v . 


(7) 
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A downstairs vector a M transforms as 

(T/i = ttjy, (8) 

where A^" = (dx^/dx 11 ) is the inverse of the Lorentz transformation 
matrix A M „. 8 

The Lorentz transformation changes components but leaves the length 
of the four-vector x unchanged. This length is given by the square root 9 
of 


\x\ 2 =x-x= (x 0 ) 2 - (x 1 ) 2 - (x 2 ) 2 - (x 3 ) 2 . 

In general, the four-vector inner product 10 is 

a ■ b = a°b° — a ■ b, 

which we can write 

a-b = g^ a^b u , 

where the metric tensor g is given by 

/ 1 0 0 0 \ 

0-10 0 
0 0-10 
V o o o —i y 


9p.v = 


( 10 ) 

( 11 ) 

( 12 ) 

(13) 


Upstairs and downstairs vectors are related by the metric tensor via 

Qfi 9a v O' 5 ( 1 ^ 1 ) 

so that we can lower or raise an index by inserting the metric tensor. 
The form of the metric tensor in eqn 13 allows us to write 

a° = a 0 a 1 — — a*, (15) 


and hence 


a-b = a^b v = a^ = a p b^. (16) 

Note also that a - b = ga^b^ and = g^. 


Example 0.2 


(i) 


(ii) 

(iii) 


An example of an inner product is 

PP = PuP^ = ( E,p ) • (E,p) = E 2 - p 2 = m 2 , (17) 

where m is the rest mass of the particle. 

The combination d^x v = = 5" and hence the inner product = 4 

(remember the Einstein summation convention). 

The d’Alembertian operator d 2 is given by a product of two derivative op¬ 
erators (and is the four-dimensional generalization of the Laplacian operator). 
It is written as 


a 2 = a^df. 


a 2 a 2 a 2 a 2 

dt- 2 dx 2 dy 2 dz 2 



(18) 


8 Note that in the equation x !t = x v 
we have 


/ 

7 

01 

0 

0 ^ 


01 

1 

0 

0 


0 

0 

1 

0 

V 

0 

0 

0 

1 ) 


In fact 


dx p \ 
dx v ) 



= St, 


where the Kronecker delta 5? is de¬ 
fined by 


Si = 


1 i — j 
0 i^j. 


The Sj symbol is named after Leopold 
Kronecker (1823-1891). 


^ Of course |ic| can be negative in special 
relativity, so it is better to deal with 
\x \ 2 , the square of the length. 

1 Note that other conventions are pos¬ 
sible and some books write a ■ b = 
-a°b° + a ■ b and define their metric 
tensor differently. This is an entirely le¬ 
gitimate alternative lifestyle choice, but 
it’s best to stick to one convention in a 
single book. 


Exercise: You can check that 


g^g vp = ^ 

and also that 

A u — n \ K n pu 
7 V — £//*« p 9 


Named in honour of the French math¬ 
ematician Jean le Rond d’Alembert 
(1717-1783). In some texts the 
d’Alembertian is written as n and in 
some as D 2 . Because of this confusion, 
we will avoid the □ symbol altogether. 


(19) 



6 Overture 


To complete this discussion, we can also define a general tensor T£;.„ 
with an arbitrary set of upstairs and downstairs indices. This transforms 

as 

dx 1 dx k dx l dx n t 

( 20 ) 


n. 


■k' 


dx i dx k dx e ' dx r 


rTl 


It has two indices (hence second 
rank) and one is upstairs, one is down¬ 
stairs (hence mixed). 


12 

This is named after Italian physi¬ 
cist Tullio Levi-Civita (1873-1941). 
Useful relationships with the Levi- 
Civita symbol include results for three- 
dimensional vectors: 

(b x c y = e ijk Vc k , 

a ■ (6 x c) = e ijk aV c k , 
and matrix algebra: 

det A — £i-±i 2 ■ --in -^ 2*2 ' ' ' i 


Example 0.3 

(i) The Kronecker delta 5*j is a ‘mixed tensor of second rank’, 11 and one can check 
that it transforms correctly as follows: 


-j _ dx 1 dx £ -i _ dx 1 dx k _ ^ 
* dx k dxi k Q x k dxi i 


( 21 ) 


Note that whenever 8%j or S lJ are written, they are not tensors and are simply a 
shorthand for the scalar 1, in the case when i = j, or 0 when i ^ j. 

(ii) The antisymmetric symbol or Levi-Civita symbol 12 e x i k ^ is defined in four 
dimensions by (i) all even permutations ijkt of 0123 (such as ijki = 2301) have 
£ zjM _ -j_. (ji) all permutations ijkt of 0123 (such as ijkt = 0213) have e % i k * = 
— 1; (iii) all other terms are zero (e.g. £ 0012 = 0). The Levi-Civita symbol can be 
defined in other dimensions. 13 We will not treat this symbol as a tensor, so the 
version with downstairs indices £ijk is identical to e i i k . 


0.5 Fourier transforms 


where A is a nx n matrix with compo¬ 
nents Aij. 


13 

The version in two dimensions is 
rather simple: 

e 01 = —£ 10 = 1, 

£ °° = e 11 = 0. 

In three dimensions, the nonzero com¬ 
ponents are: 

£ 012 = £ 201 = £ 120 = lf 

£ 021 = £ 210 =£ 102 = „ 1 


Jean Baptiste Joseph Fourier (1768— 
1830) 


We will constantly be needing to swap between representations of an 
object in spacetime and in the corresponding frequency variables, that 
is spatial and temporal frequency. Spatial frequency k and temporal 
frequency u> also form a four-vector ( co,k ) and using E = tux and p = 
hk we see that this is the energy-momentum four-vector ( E,p ). (In 
fact, with our convention h = 1 the two objects are identical!) To 
swap between representations, we define the four-dimensional Fourier 
transform f(k) of a function f(x) of spacetime x as 

f(k) = J d 4 xe ik ' x f(x), (22) 

where four-dimensional integration is defined by 

J d 4 ai = J da ,0 dx 1 da; 2 da; 3 . (23) 

The inverse transform is 

/(*) = / ^e- ifc '*/(fc), (24) 

and contains four factors of 2tt that are needed for each of the four 
integrations. Another way of writing eqn 22 is 


^ Getting these right is actually impor¬ 
tant: if you have (27 t) 4 on the top of an 
equation and not the bottom, your an¬ 
swer will be out by a factor of well over 
two million. 


f(u,k) = J d 3 xdte i(uJt - k - x) f(t,x). (25) 

In the spirit of this definition, we will try to formulate our equations so 
that every factor of dk comes with a (27 t), hopefully eliminating one of 
the major causes of insanity in the subject, the annoying factors 14 of 27 t. 
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Example 0.4 

The Dirac delta function 5(rr) is a function localized at the origin and which 
has integral unity. It is the perfect model of a localized particle. The integral of a 
d-dimensional Dirac delta function 5^ d \x) is given by 




= 1 . 


It is defined by 


f d d xf(x)5^(x) = f(0). 
Consequently, its Fourier transform is given by 

<5 (d) (fc) = f d d xe ikx 5 w (x) = 1. 
Hence, the inverse Fourier transform in four-dimensions is 


/ 


d 4 /c 


= 5^\x). 


(26) 

(27) 

(28) 

(29) 


0.6 Electromagnetism 


In SI units Maxwell’s equations in free space can be written: 


James Clerk Maxwell (1831-1879) 


V •#=-£-, VxE = -f, 

eo ’ ot ’ 

v b = o, Vxn = A i 0 j + i^. 


(30) 


In this book we will choose 15 the Heaviside Lorentz 16 system of units 
(also known as the ‘rationalized Gaussian CGS’ system) which can be 
obtained from SI by setting eo = Ho = 1. Thus the electrostatic potential 
V(yX) = g/47reo|cc| of SI becomes 


V(x) 


g 

47r|cc| ’ 


(31) 


in Heaviside-Lorentz units, and Maxwell’s equations can be written 


V-E = p, V x E = 

V B = 0, VxB = |(J+f). 


(32) 


Using our other choice of c = h = 1 obviously removes the factors of 
c from these equations. In addition, the fine structure constant a = 
e 2 /47 xhc ~ simplifies to 


^ Although SI units are preferable for 
many applications in physics, the desire 
to make our (admittedly often compli¬ 
cated) equations as simple as possible 
motivates a different choice of units for 
the discussion of electromagnetism in 
quantum field theory. Almost all books 
on quantum field theory use Heaviside- 
Lorentz units, though the famous text¬ 
books on electrodynamics by Landau 
and Lifshitz and by Jackson do not. 

^ These units are named after the En¬ 
glish electrical engineer O. Heaviside 
(1850-1925) and the Dutch physicist 
H. A. Lorentz (1853—1928). 


a = 



(33) 


Note that we will give electromagnetic charge q in units of the electron 
charge e by writing q = Q |e|. The charge on the electron corresponds 
to Q = —1. 




Part I 


The Universe as a set of 
harmonic oscillators 


In this introductory part of the book, we trace the development of the 
picture of the Universe which underpins quantum field theory. This 
picture is one in which harmonic oscillators are ubiquitous, and such 
oscillators form a central paradigm of quantum systems. We show that 
non-interacting harmonic oscillators have eigenfunctions which look and 
behave like particles and find some elegant methods for describing sys¬ 
tems of coupled oscillators. 

• In Chapter 1 we provide a formulation of classical mechanics which 
is suitable for a quantum upgrade. This allows us to talk about 
functionals and Lagrangians. 

• The simple harmonic oscillator, presented in Chapter 2, is well- 
known from basic quantum physics as an elementary model of an 
oscillating system. We solve this simple model using creation and 
annihilation operators and show that the solutions have the charac¬ 
teristics of particles. Linking masses by springs into a chain allows 
us to generalize this problem and the solutions are phonons. 

• The next step is to change our viewpoint and get rid of wave func¬ 
tions entirely and develop the occupation number representation 
which we do in Chapter 3. We show that bosons are described by 
commuting operators and fermions are described by anticommut¬ 
ing operators. 

• Already we have many of the building blocks of a useful theory. 
In Chapter 4 we consider how to build single-particle operators 
out of creation and annihilation operators, and this already gives 
us enough information to discuss the tight-binding model of solid 
state physics and the Hubbard model. 





Lagrangians 


1.1 Fermat’s principle 10 

1.2 Newton’s laws 10 

1.3 Functionals 11 

1.4 Lagrangians and least action 

14 

1.5 Why does it work? 16 

Chapter summary 1? 

Exercises 1? 



Fig. 1.1 Refraction of a light ray 
through a slab of glass. The ray finds 
the path of least travel time from A to 
B. 



Fig. 1.2 A particle moves from A to B 
in time r and its path is described by 
Newton’s laws of motion. 


In this chapter, we give an introduction to Lagrangians in classical me¬ 
chanics and explain why the Lagrangian formulation is a sensible way 
to describe quantum fields. 


1.1 Fermat’s principle 

We begin with an example from the study of optics. Consider the passage 
of a light ray through a slab of glass as shown in Fig. 1.1. The bending 
of the light ray near the air/glass interface can be calculated using the 
known refractive indices of air and glass using the famous Snell’s law of 
refraction, named after Willebrord Snellius, who described it in 1621, 
but discovered first by the Arabian mathematician Ibn Sahl in 984. In 
1662, Pierre de Fermat produced an ingenious method of deriving Snell’s 
law on the basis of his principle of least time. This states that the 
path taken between two points A and B by a ray of light is the path 
that can be traversed by the light in the least time. Because light travels 
more slowly in glass than in air, the ray of light crosses the glass at a 
steeper angle so it doesn’t have so much path length in the glass. If the 
light ray were to take a straight line path from A to B this would take 
longer. This was all very elegant, but it didn’t really explain why light 
would choose to do this; why should light take the path which took the 
least time? Why is light in such a hurry? 

Fermat’s principle of least time is cute, and seems like it is telling us 
something, but at first sight it looks unhelpful. It attempts to replace 
a simple formula (Snell’s law), into which you can plug numbers and 
calculate trajectories, with a principle characterizing the solution of the 
problem but for which you need the whole apparatus of the calculus of 
variations to solve any problems. Fermat’s principle is however the key 
to understanding quantum fields, as we shall see. 


1.2 Newton’s laws 

A somewhat similar problem is found in the study of dynamics. Consider 
a particle of mass in subject to a force F which moves in one spatial 
dimension x from point A to B, as shown in the spacetime diagram in 
Fig. 1.2. Here time is on the horizontal axis and space is on the vertical 
axis. The exact path x(t) that the particle takes is given by Newton’s 
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laws of motion, i.e. 

F = mi. (1.1) 

This equation can be integrated to find x(t). However, when you stop 
and think about it, this is a very quantum-unfriendly approach. The 
solution gives you the position of the particle at every time t from t = 0 
to t = t. Quantum mechanics tells us that you might measure the 
particle’s position at t = 0 and find it at A, and you might measure it 
again at t = t and find it at B, but you’d not be able to know precisely 
what it did in between. Having a method which lets you calculate x(t) 
from a differential equation is not a good starting point. 

Dynamics need to be formulated in a completely different way if the 
subject is to be generalized to quantum mechanics. This is exactly what 
Joseplr-Louis Lagrange and William Rowan Hamilton did, although they 
had no idea that what they were doing would make dynamics more 
quantum-friendly. We will take a slightly different approach to theirs 
and arrive at the final answer by asking ourselves how kinetic energy T 
and potential energy V vary during the trajectory of the particle. We 
know that they must sum to the total energy E = T+V which must be a 
constant of the motion. But during the trajectory, the balance between 
kinetic and potential energy might change. 

It is simple enough to write down the average kinetic energy T during 
the trajectory, which is given by 

T=-[ \m[x(t)f At. (1.2) 

The average potential energy V during the trajectory is given by 

V=- f V[x(t)]dt. (1.3) 

T Jo 

These two quantities must sum to give the total energy E = E = T + V. 
However, what we want to do is to consider how T and V vary as you 
alter the trajectory. To do this, we need a little bit of mathematics, 
which we cover in the next section. 


1.3 Functionals 

The expressions in eqns 1.2 and 1.3 are functionals of the trajectory x(t). 
What does this mean? 

Let us recall that a function is a machine (see Fig. 1.3) which turns a 
number into another number. For example, the function f{x) = 3ar will 
turn the number 1 into the number 3, or the number 3 into the number 
27. Give a function a number and it will return another number. 

A functional is a machine which turns a function into a number. You 
feed the machine with an entire function, like f(x) = x 2 or f(x) = sin a: 
and it returns a number. 


J.-L. Lagrange (1736-1813) was an 
Italian-born French mathematician and 
physicist. 

W. R. Hamilton (1805-1865) was an 
Irish mathematician and physicist. 


function 



Fig. 1.3 A function turns a number 
into another number. A functional op¬ 
erates on an entire function and pro¬ 
duces a number. 



Example 1.1 

Here are some examples of functionals. 

• The functional F[f] operates on the function / as follows: 

F\f]= f 1 f(x) dx. (1.4) 

Jo 

Hence, given the function f(x) = x 2 , the functional returns the number 

F[f] = f 1 x 2 dx = \. (1.5) 

Jo o 

• The functional G[f] operates on the function / as follows: 

G[f]= r 5{f(x)] 2 dx. (1.6) 

J —a 

Hence, given the function f(x) = x 2 , the functional returns the number 

G[f]= f 5x 4 dx = 2a 5 . (1.7) 

J —a 

• A function can be thought of as a trivial functional. For example, the func¬ 
tional F x [f] given by 

/ oo 

f(y)S(y - x)dy = f(x), (1.8) 

-oo 

returns the value of the function evaluated at x. 


We now want to see how a functional changes as you adjust the function 
which is fed into it. The key concept here is functional differentiation. 
Recall that a derivative of a function is defined as follows: 


d/ = Hm f{x + e) - f{x) 
dx €—*o e 


(1.9) 


The derivative of the function tells you how the number returned by 
the function /( x) changes as you slightly change the number x that you 
feed into the ‘machine’. In the same way, we can define a functional 
derivative of a functional F[f] as follows: 


6F - liiTi + e5 ^ x ~ x ~ nim 

6 fix) e^o e [ ’ 

The functional derivative tells you how the number returned by the 
functional F[f{x)} changes as you slightly change the function fix) that 
you feed into the machine. 


Example 1.2 

Here are some examples of calculations of functional derivatives. You can work 
through these if you want to acquire the skill of functional differentiation, or skip to 
the next bit of text if you are happy to take the results on trust. 
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The functional /[/] = J) 1 f fix) da; has a functional derivative given by 


sm 

Sf(xo) 


lim — 
e —>0 £ 


J [f{x) + e5(x-xo)]dx-j f(x)dx 
j 6{x - xo) 


— rro) d# 

-1 

1 -1 < x 0 < 1 

0 otherwise. 


(i.ii) 


The functional J[f] = f [f (y)] p d y has a functional derivative with respect 
to f(x) given by 


"W = lim — 
e 


Sf(x) 


J[f{y) + e6(y - x)] p <t>(y) d y - J [f (y)\ p <f>(y) d y 


= P[f( x )] p 1 <K X )- 


( 1 . 12 ) 


The functional H[f] = f a g[f(x)] dx, where g is a function whose derivative is 
g' = dg/dx, has a functional derivative given by 

SH[f] 


Sf(xo) 


lim — 

e—>0 € 

lim — 

e —^0 e 


J g[f(x) + eS(x - so)] da; - J g[f(x)] da; 

J (s[/(®)] + e5(x - so)s'[/(s)]) da; - J g[f{x)] da; 
J S(x- x 0 ) g'[f(x)]dx 


= g’[f( x o)]. 


(1.13) 


Using the result of the previous example, the functional V[x] = ^ Jq V[x(t)] dt 
so that 

-^wi. “ ,4 > 

The functional J[f] is defined by J[f] = f g{f) dy where f = df /dy. Hence 

'Of' 


SJ W = lim 1 


J dyg(J^[f(.y) + e8(.V-x)]j - J dy g 


dyJi 


(1.15) 


5f(x) e 

and using 

g (J^[f(y) + eS(y - = 9 (/' + eS'(y-x)) m g(f') + eS'(y — x) ~^TT~ > 

(1.16) 

then the calculation can be reduced to an integral by parts 


SJ[f] 
<5 f{x) 




5{y - x) 


df \ J da/ v df 

(1-17) 

The term in square brackets vanishes assuming x is inside the limits of the 
integral, and we have simply 

SJ[f = _d_ ( dg(f') ' 

Sf{x) da; V d/' 


(1.18) 


An example of the previous result is that for F[<j>] = f ( J ~ dy, 

5<j)(x) dx 2 

Another example is for T = ^ /q ^ m[x(t )] 2 dt, 

5T[x] mx 

8x(t) r 


(1.19) 


( 1 . 20 ) 


Equation 1.19 can be easily generalized 
to three dimensions and leads to a very 
useful result which is worth memoriz¬ 
ing, namely that if 


then 


I = 


j (V <« 2 


SI 

S(j) 


= —2 V 2 <t>. 
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1.4 Lagrangians and least action 


1 They might both decrease by the same 
amount if the classical trajectory max¬ 
imizes (rather than minimizes) the ac¬ 
tion, see below, but this case is not the 
one usually encountered. 


With these mathematical results under our belt, we are now ready to 
return to the main narrative. How does the average kinetic energy and 
the average potential energy vary as we adjust the particle trajectory? 
We now have the main results from eqns 1.14 and 1.20 which are: 


8V[x\ V'[x(t)\ 

5x(t) t 


ST[x] mx 

Sx(t) T 


( 1 . 21 ) 


In fact, Newton’s laws tell us that for the classical trajectory of the 
particle we have that mx = —dV/dx. This means that for variations 
about the classical trajectory 


SV[x] _ ST[x) 
5x{t) 8x(t) ’ 


This means that if you slightly deviate from the classical trajectory, 
then both the average kinetic energy and the average potential energy 
will increase 1 and by the same amount. This equation can be rewritten 
as 

Aym - v[x]) = 0, (1.23) 

i.e. that the difference between the average kinetic energy and the av¬ 
erage potential energy is stationary about the classical trajectory. This 
shows that there is something rather special about the difference be¬ 
tween kinetic energy and potential energy, and motivates us to define a 
quantity known as the Lagrangian L as 


L = T — V. 


(1.24) 


The integral of the Lagrangian over time is known as the action S 


2 

To be pedantic, the principle only 
shows the action is stationary. It could 
be a maximum, or a saddle point, just 
as easily as a minimum. ‘Stationary 
action’ would be better than ‘least ac¬ 
tion’. But we are stuck with the name. 



Fig. 1.4 Small adjustments to the path 
of a particle from its classical trajectory 
lead to an increase in the action S. 


S= [ Ldt, (1.25) 

Jo 

and so the action has dimensions of energy x time, and hence is measured 
in Joule-seconds. This is the same units as Planck’s constant h, and we 
will see later in this chapter why it is often appropriate to think of 
measuring S in units of Planck’s constant. Our variational principle 
(eqn 1.23) connecting variations of average kinetic energy and average 
potential energy can now be rewritten in a rather appealing way since 
for this problem S = (T — V) dt = t(T[x] — P[a;]), so that 


as 

5x(t ) 


= 0 , 


(1.26) 


and this is known as Hamilton’s principle of least action. 2 It states 
that the classical trajectory taken by a particle is such that the action is 
stationary; small adjustments to the path taken by the particle (Fig. 1.4) 
only increase the action (in the same way that small adjustments to the 
path taken by a ray of light from the one determined by Snell’s law 
lengthen the time taken by the light ray). 
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Example 1.3 


The Lagrangian L can be written as a function of both position and velocity. Quite 
generally, one can think of it as depending on a generalized position coordinate x(t) 
and its derivative x(t ), called the velocity. Then the variation of S with x{t) is 
5S/8x(t) and can be written as 


SS 

8x(t) 



8L 

8x(t) 

SL 

Sx(t) 


SL Sx(u) SL Sx(u) 


l Sx(u) Sx{t) Sx(u) Sx(t) J 

SL e/ N SL d p/ 

5 ( u - *) + 7 ~rj( u ~ *) 


\_Sx(u) 

S(u — t) 


+ 


SL 


Sx(u) d t 
1*/ 


Sx(u)\ i 


— J duS(u — t) 


d SL 
d t Sx(u) 


d SL 
d t Sx(t) ’ 


and hence the principle of least action (eqn 1.26) yields 


(1.27) 


SL d SL 

Sx(t) d t Sx(t) 

which is known as the Euler—Lagrange equation. 


(1.28) 


Leonhard Euler (1707-1783). In the 
words of Pierre-Simon Laplace (1749- 
1827): Read Euler, read Euler, he is 
the master of us all. 


The Lagrangian L is related to the Lagrangian density C by 


L = 



(1.29) 


so that the action S is given by 


S = 


/ 


d tL = 


J d t d x C. 


(1.30) 


The following example introduces the idea of a Lagrangian density, a 
concept we will come back to frequently, but also provides a nice way to 
derive the classical wave equation. 


Example 1.4 


Consider waves on a string of mass m and length i. Let us define the mass density 
p = m/£, tension T and displacement from the equilibrium ?/>( x, t) (see Fig. 1.5). The 
kinetic energy T can then be written as T = 1 dx p(dtp/dt) 2 and the potential 
energy V = ^ dx L~(d'lp/dx) 2 . The action is then 

S[i>{x,t)\ = j d t(T-V) = j dtdx£ (i/>, L. ^) , (1.31) 


where 


C u. 


dip 

1 ht’ 


dip 

dx 


T 

~2 


is the Lagrangian density. We then have immediately 



5S _ dC _ 
Sip dip 

= 0 + T 


d dC 
dx d(dip/dx) 
d 2 ip d 2 ip 
dx 2 ^ dt 2 ’ 


d dC 
dt d{dip/dt) 


and so the wave equation (d 2 ip/dx 2 ) = (1/u 2 )(d 2 ip/dt 2 ) with v = 
almost effortlessly. 


(1.32) 


(1.33) 
VVp emerges 


dx 



Fig. 1.5 Waves on a string. The dis¬ 
placement from equilibrium is -*/>( x , t) 
and the equation of motion can be de¬ 
rived by considering an element of the 
string of length dx and mass pdx. The 
figure shows a short section in the mid¬ 
dle of the string which is assumed to be 
tethered at either end so that -ip(0,t) = 
ip(£, t). 
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3 

Recall from the argument in Sec¬ 
tion 0.4 that the index fi in d^(j) is nat¬ 
urally lowered; see eqn 0.2. 


^Remember that we are using the Ein¬ 
stein summation convention, by which 
twice repeated indices are assumed 
summed. Equation 1.35 is simply 
the four-dimensional generalisation of 
eqns 1.27 and 1.28. 


c 

Fermat’s principle of least time tells us 
something about optical path length of 
a ray, that is the difference between the 
phase at the beginning and end of a ray. 
By analogy with Hamilton’s principle 
of least action of a classical mechanical 
system, one can posit that the action S 
is given by a constant multiplied by the 
phase of a wave function. This defines 
the constant which is given the symbol 
h. Thus we take the phase to be S/h. 


As a final trick, let us put the Euler- Lagrange equation on a fully rela¬ 
tivistic footing, bracing ourselves to use some four-vector notation and 
the Einstein summation convention (see Section 0.4). If the Lagrangian 
density C depends on a function <j>(x) (where a: is a point in spacetime) 
and its derivative 3 then the action S is given by 

S = J d 4 x £((/>, d^). (1.34) 


By analogy with eqn 1.27, the action principle gives 4 


ss dc ( dc 

s<t> 80 ^0(8^)) 


(1.35) 


the four-vector version of the Euler-Lagrange equation. 


Example 1.5 

As a simple example of this, consider the case when £ is given by 

c = ^(c>m0) 2 - (1-36) 

where (d M </>) 2 = (<9 M </>)(c^0). Simple differentiation then gives 

= - m2 4 >, and = 3 **^. (T37) 

d(f) dip^cj)) 

Use of the action principle (eqn 1.35) gives 

S S 

— = = 0 and hence ( d 2 + = 0. (1.38) 


1.5 Why does it work? 

In this chapter, we have considered two variational principles: Fermat’s 
principle of least time and Hamilton’s principle of least action. One 
describes the path taken by a ray of light, the other describes the path 
taken by a classical particle. They are very elegant, but why not stick 
to using Snell’s law and Newton’s law? And why do they both work? 

The answer to both of these questions is quantum mechanics. We will 
talk about this in more detail later in the book, but the motion of a 
particle (photon or billiard ball) going from A to B involves all possible 
paths, the sensible classical ones and completely insane ones. You have 
to sum them all up, but they are each associated with a phase factor, and 
for most sets of paths the different phases mean that the contributions 
cancel out. It is only when the phase is stationary that nearby paths all 
give a non-cancelling contribution. The wave function for a particle has 
a phase factor 5 given by 

e iS /\ 


(1.39) 
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where S = f L dt is the action, so a stationary phase equates to a sta¬ 
tionary action (eqn 1.26). (Running the argument in reverse, the phase 
factor for a photon of energy E is and so stationary phase 

equates to stationary time, which is Fermat’s principle.) We will see 
how this approach leads naturally to Feynman’s path-integral approach 
later in the book (Chapter 23). But for now, notice simply that if the 
action is stationary then the classical path which minimizes the action 
is the one that is observed and everything else cancels. 

Snell’s law and Newton’s law are enough to solve classical systems. 
But neither allow the generalization to quantum systems to be performed 
with ease. Thus, to formulate quantum field theory (the grand task of 
this book), we have to start with a Lagrangian picture. Our next step is 
to gain some insight into what happens in non-classical systems, and so 
in the next chapter we will turn our attention to an archetypal quantum 
system: the simple harmonic oscillator. 


Chapter summary 


• Fermat’s principle of least time states that light takes a path which 
takes the least time. 


The Lagrangian for a classical particle is given by L = T — V. 

Classical mechanics can be formulated using Hamilton’s principle 
of least action. 

<L40) 


where the action S = J Ldt. 

• Both Fermat’s and Hamilton’s principles show how the classical 
paths taken by a photon or massive particle are ones in which the 
phase of the corresponding wave function is stationary. 


Exercises 


(1.1) Use Fermat’s principle of least time to derive Snell’s 
law. 


(1.2) Consider the functionals H[f] = f G(x,y)f(y)dy, 

I[f] = f-i f( x ) dx and J [f] = J dy of the 


function /. Find the functional derivatives 

and 5JU I 
Sf(x 0 )5f(xi) Sf(x)' 


SH[f] 

sm ’ 


(1.3) Consider the functional G[f] = f g(y,f)dy. Show 
that 

$G[f] _ d a( x , f) n , u 

sm 8/ ■ 1 • J 

Now consider the functional H[f\ = J g(y, /, f) dy 
and show that 


SH[f] dg d 8g 

Sf{x) df dx df ’ 


(1.42) 
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where /' = df/dy. For the functional J[f] = 
J g{y, f, /', /") dy show that 

8J[f] = dg d dg d 2 dg 

Sf(x) df da; df da; 2 df" ’ V ’ 


where f" = d 2 f/dy 2 . 
(1.4) Show that 

8(j>(x) 


= S(x - y), 




J d 3 ® (Vi 


and the kinetic energy is 


I d2x {‘ 


Use these results, and the functional derivative ap¬ 
proach, to show that ip obeys the wave equation 

< L48) 

where v is the velocity of the wave. 

(1.6) Show that if Zq[J\ is given by 


(L45) 

(1.5) For a three-dimensional elastic medium, the poten¬ 
tial energy is 


Z 0 [J]=exp j d 4 a;d 4 y J(x)A(x - y)J(y) J , 

(1.49) 

where A(a:) = A(— x) then 


8Z 0 [J] 
8J(z i) 


-[I 


= - \ d*yA(zi-y)J{y) Z 0 [J}. (1.50) 



Simple harmonic oscillators 



2.1 Introduction 

The advent of quantum mechanics convinced people that things that had 
previously been thought of as particles were in fact waves. For example, 
it was found that electrons and neutrons were not simply little rigid 
bits of matter but they obey a wave equation known as the Schrodinger 
equation. This idea is known as first quantization. To summarize: 


2.1 Introduction 

2.2 Mass on a spring 

2.3 A trivial generalization 

2.4 Phonons 
Chapter summary 
Exercises 


19 

19 

23 

25 

27 

27 


First quantization: Particles behave like waves. 


( 2 . 1 ) 


However, this is not the end of the story. It was also realized that 
things that had been previously thought of as waves were in fact par¬ 
ticles. For example, electromagnetic waves and lattice waves were not 
simply periodic undulations of some medium but they could actually 
behave like particles. These phenomena were given particle-like names: 
photons and phonons. This idea is known as second quantization. To 
summarize: 


Second quantization: Waves behave like particles. 


( 2 . 2 ) 


Quite how these ideas link up is one of main themes in quantum field 
theory which sees the underlying reality behind both waves and particles 
as a quantum field. However, before we get to this point, it is worth 
spending some time reviewing second quantization in a bit more detail 
as it is the less familiar idea. In this chapter, we focus on the most 
famous example of a wave phenomenon in physics: the oscillations of a 
mass on a spring. 


2.2 Mass on a spring 

Consider a mass on a spring (as shown in Fig. 2.1), one of the simplest 
physical problems there is. Assume we have mass to, spring constant 
K , the displacement of the mass from equilibrium is given by x and the 
momentum of the mass is given by p = mx. The total energy E is the 
sum of the kinetic energy p 2 /2m and the potential energy \Kx 2 . 

In quantum mechanics, we replace p by the operator —i hd/dx, and 
we then have the Schrodinger equation for a harmonic oscillator 





(_ h 2 d 2 
\ 2 to dx 2 





Eil). 


(2.3) 


Fig. 2.1 A mass m suspended on a 
spring, of spring constant K. 
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Fig. 2.2 Eigenfunctions of the simple 
harmonic oscillator. 

K) : 

- n = 5 

- n = 4 

- n = 3 

-j- n = 2 

hu> 

-1- n = 1 

- n = 0 

. E = 0 


Fig. 2.3 The ladder of energy levels for 
the simple harmonic oscillator. 


The solutions to this equation can be obtained by a somewhat involved 
series-solution method and are given by 

where iJ„(£) is a Hermite polynomial and £ = y/mu/hx. As shown 
in Fig. 2.2, these eigenfunctions look very wave-like. However, they 
do have a ‘particle-like’ quality which is apparent from the eigenvalues, 
which turn out to be 

E n = (n + ^ j hu, (2.5) 

where to = sjK/m. This is a ladder of energy levels (see Fig. 2.3), though 
note when n = 0 the energy is not zero but fou/2. This is called the 
zero-point energy. Because the energy levels form a ladder, you must 
add a quantum of energy hu to move up the ladder by one rung. It is as 
if you are adding a lump, or particle, of energy. Can we make this vague 
feeling any more concrete? Yes we can. Moreover, we can do it quite 
elegantly and without dirtying our hands with Hermite polynomials! 

To accomplish this, we start with the Hamiltonian for the simple har¬ 
monic oscillator written out as follows: 

„ Xp 1 \ 

H = -f- + -mu 2 x 2 , (2.6) 

2 m 2 

where we have expressed the spring constant as K = mu 2 . One feels 
the seductive temptation to factorize this Hamiltonian and write it as 

\mu 2 (x -— V) \ x+ —p] , (2.7) 

2 \ mw ) \ mu ) 


but we hit a problem: in quantum mechanics, the operators x and p do 
not commute. Hence, multiplying out the brackets in eqn 2.7 doesn’t 
quite give us back H but instead produces 


-mu 


x - p 

muj 


x + 


1 2^2 
p l = -mu x 


+ t + Y^ ^ 


where [x,p] = xp — px = ih is the commutator of x and p. The right- 
hand side of eqn 2.8 is then just H — which is nearly H but has 
the correction — Ay due to the zero-point energy being subtracted. The 
factorization can therefore be made to work and we realize that the 
operators x — p and x + ^,p are clearly going to be useful. Since p 
and x are Hermitian, the operators x — and x + -^jP are adjoints 
of each other (and therefore are not themselves Hermitian, so cannot 
correspond to any observable). We will give them the special names a' 
and a, although we will include a multiplicative constant \Jmu/‘ITi in 
our definition so that they have particularly nice properties. Hence we 
will write 



(2.9) 

( 2 . 10 ) 





Example 2.1 

The commutator [d, at] can be evaluated as follows (remembering that [x,p] = i h): 
A+ - mu ( 1 . i 


a, a 1 = 


ofc , -[P. £ ] 

2h V rnuJ ma; 


mu 

~2h 


h 

mu 


h 

mu 


= 1. 


( 2 . 11 ) 


The definitions of a and a' can be inverted to give 


„ . / hm u >, * „ + . 

P = -i y—(a-a 1 )- 


( 2 . 12 ) 

(2.13) 


Putting our new definitions of a and into our equation for the Hamil¬ 
tonian yields 

H = hjj . (2-14) 

The active ingredient in this Hamiltonian is the combination a'a. If 
a* a has an eigenstate | n) with eigenvalue n, then H will also have an 
eigenstate |n) with eigenvalue huj(n+ |), so that we have recovered the 
eigenvalues of a simple harmonic oscillator in eqn 2.5. However, we need 
to prove that n takes the values 0,1,2,.... The first step is to show that 
n > 0. We can do that by noting that 


n = (n|a^a|n) = |a|n)| 2 > 0. 


(2.15) 


Next, we have to show that n takes only integer values, and we will 
do that below but beforehand let us introduce a bit of notation to save 
some writing. We define the number operator h by 

n = a 1 a, (2-16) 

and hence write 

n\n) = n\n). (2-17) 

Number of what? The quantity n labels the energy level on the ladder 
(see Fig. 2.3) that the system has reached, or equivalently the number 
of quanta (each of energy hio) that must have been added to the system 
when it was in its ground state. We can rewrite the Hamiltonian as 

H = Hco , (2-18) 

and therefore 

H\n) = (n + Tujj\n), (2-19) 

so that | n) is also an eigenstate of the Hamiltonian. Thus | n) is a 
convenient shorthand for the more complicated form of V’n(C) shown in 
eqn 2.4. The next examples show that the eigenvalue n indeed takes 
integer values. 
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Example 2.2 

This example looks at the property of the state defined by a)\n). One of the things 
we can do is to operate on this with the number operator: 

ha)\n) = d^dd^|n). (2.20) 

Using the commutator in eqn 2.11 gives aa * = 1 + a' a and hence 

ha)\n) = (n + l)a^|n). (2.21) 


The above example shows that the state a^\n) is an eigenstate of H but 
with an eigenvalue one higher than the state | n). In other words, the 
operator a t has the effect of adding one quantum of energy. For this 
reason is called a raising operator. 


Example 2.3 

This example looks at the property of the state defined by d|n). We can operate on 
this with the number operator: 

nd|n) = d^dd|n). (2.22) 

Using the commutator in eqn 2.11 gives a'a = aa' — 1 and hence 

nd|n) = (n — l)d|n). (2.23) 


The above example shows that the state a\n) is an eigenstate of H but 
with an eigenvalue one lower than the state |n). In other words, the 
operator a has the effect of subtracting one quantum of energy. For this 
reason a is called a lowering operator. 


Example 2.4 

Question: Normalize the operators a and dJ. 

Solution: We have shown that d|n) = k\n — 1), where A; is a constant. Hence, taking 
the norm of this state (i.e. premultiplying it by its adjoint) gives 

|d|n)| 2 = (n|d^d|n) = |/c| 2 (n — 1| n — 1) = \k\ 2 , (2.24) 

where the last equality is because the simple harmonic oscillator states are normalized 
(so that (n — l|n — 1) = 1). However, we notice that d^d = h is the number operator 
and hence 

(n|d'd|n) = (n|fi|n) = n. (2.25) 

Equations 2.24 and 2.25 give k = y r n. (This assumes k to be real, but any state 
contains an arbitrary phase factor, so we are free to choose k to be real.) 

In the same way, we have shown that a'\n) = c\n + 1), where c is a constant. 
Hence, 

|d^|n)| 2 = (n|dd^|n) = |c| 2 (n + l|n + 1) = |c| 2 , (2.26) 

and using aa' = l + dtd = 1 + n, we have that 

(n|dd^|n) = (n|l + n|n) = n + 1, (2.27) 

and hence c = \Jn + 1 (choosing c to be real). In summary, our results are: 

d|n) = v^ln-l), (2.28) 

d f |n) = \/rT+T|n+l). (2.29) 
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If we keep hitting the state |n) with a, we eventually get to |0), the 
ground state of the simple harmonic oscillator. At this point, we will 
annihilate the state completely with a further application of a because 
a|0) = 0. Notice that 


H |0) = Huj 



| 0 ) 




(2.30) 


so this really is the ground state and we see that the energy is ^Hw, the 
zero-point energy. 1 

It is also possible to go the other way, operating on |0) with the raising 
operator This leads to 


1 To recap, we can write 

|a|n )| 2 = (n|ata|n) 

, , H 1 

= n --- n) 

FujJ 2 

_ E n 1 

huj 2 ’ 

where E n = (n + ^)hu>. Thus the con¬ 
dition a|0) = 0 implies ^ = 0, and 
hence the energy of the ground state is 
given by Eq = \EjJ. 


d f |0) = |1), (2.31) 

at|l) = \/2|2) =► |2)= ( ^|0), (2.32) 

dt| 2 ) = V3|3) =► 13) = -4==|0), (2.33) 

V3 x 2 

and in general 

(a'Y 

|n) = t_^| 0 ), (2.34) 

Vnl 

so the state | n) can be obtained by repeated application of the operator 
at This leads to a completely different way of thinking about these 
new operators: we call a' a creation operator and a an annihilation 
operator. We imagine a) acting to create a quantum of energy Hu and 
move the oscillator up one rung of the ladder. Its adjoint, a, acts to 
annihilate a quantum of energy Hu and move the oscillator down one 
rung of the ladder (see Fig. 2.4). These quanta of energy behave like 
particles; we are adding and subtracting particles by the application of 
these operators and we have realized the dream of second quantization. 
A wave problem has spontaneously produced particles! 


2.3 A trivial generalization 




at' 


a 

at 


a 

at ' 


a 

at ‘ 


a 

at ‘ 


a 



a 


•A 

V 


't \ N 


|5> 

|4> 

|3) 

|2) 

|1> 

|0> 


E = 0 


Fig. 2.4 The ladder of energy levels for 
the simple harmonic oscillator. The op¬ 
erator a t moves the oscillator up one 
rung of the ladder. Its adjoint, a , acts 
to move the oscillator down one rung 
of the ladder. Note that a|0) = 0, so 
dropping off the bottom of the ladder 
results in zero. 


The next thing we can do is a completely trivial generalization of what 
we have done before, but it is worth thinking about because certain 
complicated problems reduce to it. Consider a set of N uncoupled simple 
harmonic oscillators. They are uncoupled, so one could be on your desk, 
another sitting in your bathroom, another one out in the park. They 
don’t talk to each other, influence each other or affect each other in 
any way. Nevertheless, just for fun, we are going to consider their joint 
Hamiltonian H which is simply the sum of individual Hamiltonians H k 
where k runs from 1 to N. Hence 


N 

h = Y. h ^ 


k =1 


(2.35) 
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H k = ^ + ^m k ulx 2 k . (2.36) 

So far, nothing complicated. We now define the operator a' k which cre¬ 
ates a quantum of energy in the /cth oscillator (and leaves all the others 
unaffected). We also define the operator a k which annihilates a quan¬ 
tum of energy in the fctli oscillator (and leaves all the others unaffected). 
Acting with an operator a k only affects the number of quanta for the 
fcth oscillator. We can therefore write 


a\\ni,n 2 , ...,n r ,...) oc |ni,n 2 , —,n k + 1,...), 
dfe|ni,n 2 , ...,n k ,...) a |ni,n 2 , —,n k - 1,...). 

The operators will have the commutation rules: 

[a k ,a q = 0, 

= °- 

[a k ,al\ = 6 kq . 


(2.37) 


(2.38) 

(2.39) 

(2.40) 


In eqn 2.40, the point is that operators acting on different oscillators 
commute (and therefore do not affect each other). Hence we can follow 
the results of the previous section and write the Hamiltonian in eqn 2.35 
as 

N / 

H = f a\a k + - j . (2.41) 

A--1 ' ' 

Again we have to define a vacuum state |0, 0, 0, 0, 0,...) (usually just 
called |0», such that 

a fc |0)=0, (2.42) 

for all k. This is the state in which every one of the oscillators is in its 
ground state. 

A general state of the system, written as \ni, n 2 , • • • , tin) is known as 
the occupation number representation. Using the same techniques 
as in Section 2.2, we can write this general state as 


,n N ) = ] =(a t 1 ) rai (a|) n2 ---(aj v ) nw |0,0,---0). 

\Zni\n2i ■ ■ ■ un- 

(2.43) 

The idea here is that we are operating on the vacuum state with a 
product of creation operators necessary to put n\ quanta of energy into 
oscillator number 1, n 2 quanta of energy into oscillator number 2, etc. 
We can express this even more succinctly using the following notation 


iK}> = n^(4n°>, 

k V n fc 


(2.44) 


where |0) is the vacuum state, as explained above. 

In this model, all of the oscillators have been completely indepen¬ 
dent. It is now time to tackle a more challenging problem, coupling the 
oscillators together, and that is covered in the next section. 
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2.4 Phonons 


Consider a linear chain of N atoms (see Fig. 2.5), each of mass m, and 
connected by springs of unstretched length a and which have spring 
constant K. The masses are normally at position Rj = ja, but can be 
displaced slightly by an amount Xj. The momentum of the jth mass is 
Pj. The Hamiltonian for this system is 

^ = E£ + ^+i-^) 2 - ( 2 - 45 ) 

3 

In contrast to the model in the previous section, we are now dealing 
with a coupled problem. Each mass is strongly coupled to its neighbour 
by the springs and there is no way in which we can consider them as 
independent. However, we will show that the excitations in this system 
behave exactly as a set of totally independent oscillators. This works 
because we can Fourier transform the problem, so that even though 
the masses are coupled in real space, the excitations are uncoupled in 
reciprocal space. How this is done is covered in the following example. 



I 

t 

Rj = j a, 


Fig. 2.5 A linear chain of atoms. 


Example 2.5 

We begin by Fourier transforming both Xj and pj , by writing 



Xj 


(2.46) 


pj 


(2.47) 

and equivalently of course 





Xk 

(2.48) 


Pk 

3 

(2.49) 

We impose periodic boundary conditions 2 forcing e lfej ' a = e lk U+ N ) a The wave vector 

k therefore takes the values 

2mn/Na , where m is an integer in the range 

—N/2 < 

m < N/2. Note that 






E eiiio = N8 k ,o- 

(2.50) 

The commutation relations 

are 

3 



= 

m jj’’ 

(2.54) 


= 

^EE c_ifcj ' 0e " ifc,/<I [wy] 

7 i' 



= 

j j 

lh 0 -i(k+k')ja 

N ^ 

J 



= 

i fiSfc . 

(2.55) 


We can now work out terms in the Hamiltonian, so that 



= ^EEE™e i(fc+fc ' )ja . (2.56) 

3 k k' 


For the time being, we will drop the 
‘hats’ on Xj and pj to save cluttering 
the algebra. We note that there is an¬ 
other way of solving this problem which 
involves writing the operators Xj and 
Pj in terms of creation and annihila¬ 
tion operators of particular modes. It 
is arguably a more elegant route to the 
answer and will generalize more easily 
to additional cases, but as it is a little 
more abstract we will save it for later 
in the book. 

o 

Periodic boundary conditions are de¬ 
scribed in detail in the next chapter. 


The second quantization trick 

to evaluate sums of the form 

^EE PkP q e i(k+q)ja ■ (2.51) 

3 kq 

Step 1: Perform the spatial 

sum, making use of the identity 
Ej e 1 ( fc-fc/ )- 7a = AT^fc/. This gives 

E PkPqh-q • (2.52) 

pq 

Step 2: Use the Kronecker delta to do 
one of the momentum sums. This has 
the effect of setting q = — k, leaving us 
with a sum over a single index: 

E PkP-k■ (2.53) 

k 
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q 

Notational point: For clarity, we will 
now omit the tilde from xk and pk , but 
to remind ourselves that they are op¬ 
erators we will reinstate the ‘hats’, so 
they become Xk and pk- 


4 • 


U, - 


l.e. CLk a k — a k a k 


= 1. 


This combination of sums over one spatial and two momentum indices will occur 
frequently and is easily dealt with using the trick shown in the box. The result is 

X)p| = X)pfcp-fc- (2.57) 

j k 


The other term in the Hamiltonian may be treated similarly and we have 


X^j+i - x ^ 2 

j 


vEEE 

j k k' 

^2x k x_k ^4sin 2 ^) , 


l)( e ifc'a 


1) 


(2.58) 


where use has been made of the identity 1 — cos ka = 2 sin 2 . We can thus express 3 
the Hamiltonian as 

* = E 

k 

where cj 2 = (AK/m) sin 2 (fca/2). 

With our definitions we automatically have that pj. = p—k and x^. = X—k (which 
follow from the requirement that pj and Xj are Hermitian), so we can write down 
the commutation relation as [x k ,p' k ,] = i fabkk' an d we can a l so write down creation 
and annihilation operators as follows: 


—PkP-k + -mw k x k X- k 

2m 2 


(2.59) 


a k — 


4 = 


mujk 
2 h 

mujk 

2 h 


Xk • 


mujk 

i 


-Pk 


Xk Pk 

mujk 


(2.60) 

(2.61) 


which have commutation relations [a 
can invert eqns 2.60 and 2.61 to yield 


= [o-k^k f ] = 0 and [a k ,a\ f ] = 8 k y. We 


Xk = 


/5^( afc + a E)’ 


Pk = -i 

The Hamiltonian becomes 


mhjJk 


k I • 


& = E ( afc 4 + . 


_ ^ twk_ (a k a\ + a\a k 


and reindexing this gives 

rUjJ k 

~k 2 

Using the commutator 4 then produces 

N / 

& =e ^ ( a i afc + o ) • 


(2.62) 

(2.63) 

(2.64) 

(2.65) 

( 2 . 66 ) 


This example has demonstrated that the Hamiltonian for a set of coupled 
masses on a chain can be expressed in terms of a sum over modes, 
labelled by wave vector k. Comparing this result (eqn 2.66) with that 
obtained at the end of the previous section (eqn 2.44), one sees that 
they are identical. Thus these modes behave as if they are entirely 
independent and uncoupled simple harmonic oscillators. We call these 
modes phonons, and each phonon mode, labelled by k, can be given 
integer multiples of the quantum of energy hio k - 
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This is the key point. Each phonon mode behaves like a simple har¬ 
monic oscillator and so can accept energy in an integer number of lumps, 
each lump being of size Hu>k- This is because the energy eigenvalues of a 
simple harmonic oscillator form a ladder of energy levels where the rung- 
size is fixed. These lumps of energy look like particles, and so we think 
of phonons themselves as particles. Later on, we play exactly the same 
trick with the electromagnetic field and show that it behaves like a set 
of uncoupled simple harmonic oscillators and call the quanta photons. 
This is the heart of second quantization: wave problems have oscillator 
solutions and hence produce particles! The key insight in this chapter 
is essentially that the oscillator picture of physical systems leads to a 
particle picture of those systems. This notion is summarized in Fig. 2.6. 


Simple harmonic oscillator 
in the nth level 
E n = (n + hu k 

OSCILLATOR PICTURE 


n particles of energy 

PARTICLE PICTURE 


Chapter summary Fig. 2.6 The equivalence of the oscil¬ 

lator and particle pictures. 

• First quantization shows that particles behave like waves; second 
quantization shows that waves behave like particles. 

• The simple harmonic oscillator has energy eigenvalues given by 

E n = (n + Eigenstates can be written in the occupation 

number representation. 

• Using creation and annihilation operators the Hamiltonian for the 
simple harmonic oscillator can be written H = huj(a*a + |). 

• The phonon problem can be re-expressed as a sum over non¬ 
interacting modes, each one of which behaves like a simple har¬ 
monic oscillator. 


Exercises 


(2.1) For the one-dimensional harmonic oscillator, show 
that with creation and annihilation operators de¬ 
fined as in eqns 2.9 and 2.10, [a, a] = 0, [o' , o'] = 0, 
[a, <U] = 1 and H = hu)(a}a + |). 

(2.2) For the Hamiltonian H = + A mui 2 x 2 + A* 4 , 

where A is small, show by writing the Hamiltonian 
in terms of creation and annihilation operators and 
using perturbation theory, that the energy eigenval¬ 
ues of all the levels are given by 

En= ( n+ ^) ftw+ T (i) ( Zn+2n+ 1 ). 

(2.67) 


(2.3) Use eqns 2.46 and 2.62 to show that 

^ 1 { h \ V ' 1 i kja , t —ifcjai 

* , = 7vUJ?cVF il “‘ e + “‘ e '■ 

( 2 . 68 ) 

(2.4) Using a|0) =0 and eqns 2.9 and 2.10 together with 
(x\p\i/j) = —ih^(x \ip), show that 


and hence 


h d i 
mu) da ;) 

I (®|o), 

(2.69) 

1/4 

— mu) 

e 

x 2 /2h 

(2.70) 
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1 Satyendra Nath Bose (1894-1974) 
wrote a paper in 1924 which laid the 
foundations for quantum statistics. 

2 

In addition to his many contribu¬ 
tions to other branches of physics, En¬ 
rico Fermi (1901-1954) made numerous 
contributions to quantum field theory. 
His 1932 review of Quantum Electro¬ 
dynamics (Rev. Mod. Phys. 4, 87) had 
enormous influence on the field. 

3 A consequence of this principle is that 
fermions obey the Pauli exclusion prin¬ 
ciple: It is impossible to put two in the 
same quantum state. This affects the 
statistical distribution among a set of 
discrete energy levels of these indistin¬ 
guishable quantum particles, and so is 
referred to as quantum statistics. In 
Sections 3.1—3.3 we won’t worry about 
quantum statistics, but return to the 
issue in Section 3.4. 


I am not a number. I am a free man. 

Patrick McGoolian in The Prisoner 

In the previous chapter we considered simple harmonic oscillators, 
showed that such oscillators have particle-like eigenfunctions, and in¬ 
troduced en passant the occupation number representation to describe 
sets of uncoupled oscillators. Now we are going to make further use of 
this idea to describe systems of many particles more generally. The use 
of the occupation number representation to describe a system of identi¬ 
cal particles is partly a matter of notational administration and partly 
a radical change in the way we look at the world. We (i) change the 
way we label our states (this is administration) and (ii) get rid of wave 
functions altogether (this is the radical part). We replace wave func¬ 
tions with a version of the creation and annihilation operators we used 
to solve the quantum oscillator problems. This turns out to simplify 
things enormously. In setting up this new method we will need to en¬ 
sure that we never violate the sacred laws of indistinguishable particles. 
These are as follows: 

• There are two types of particle in the Universe: Bose 1 particles 
and Fermi 2 particles. 

• If you exchange two identical bosons you get the same state again. 

• If you exchange two fermions, you get minus the state you started 
with. 3 

To make things simple we’ll confine our particles to a box for most of 
this chapter. Although this makes the notation ugly (and non-covariant) 
it has several advantages (not least that we can normalize the states in 
a simple way). We recap the physics of particle in a box in the next 
section. 


3.1 A particle in a box 

We will now choose units equivalent to setting h = 1. Therefore the 
momentum operator for motion along the ^-direction p can be written 

. d 

P = - 1 W 


(3.1) 
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Solutions to the Schrodinger equation for a particle in a box of size 
L are running waves ip(x) = ^je lpx , and eigenstates of the momentum 

operator \ftip(x) = — i^ = pip(x)\. We use periodic boundary conditions 
which force us to have the same amplitude for our wave at equivalent 
positions of each box, that is ip(x) = tp(x+L). Therefore e lpx = e lp ( x+L \ 
and hence e lpL = 1. In order to satisfy this condition, we need pL = 
27r?n, with m an integer. This imposes a quantization condition on the 
possible momentum states that particles in the box can take. They must 
satisfy 


Pm — 


2nm 

~L~ 


(3.2) 


We’ll call the possible momentum states things like pi and P 2 , etc. The 
energy of a single particle in a state p rn is E Pm , which depends on the 
momentum state, but it doesn’t matter how yet. 

To make a simple many-particle system we’ll put more and more non¬ 
interacting Bose particles in the box. How does this affect the total 
momentum and energy of the system? Applying the momentum opera¬ 
tor to the two-particle state (see Fig. 3.1) yields 


P\PlP2) = (Pi +P2)\PlP2), 


(3.3) 


and applying the Hamiltonian operator gives us 

H\pipz) = (. E P1 + E P2 )\pip 2 ). (3.4) 


Since the particles don’t interact, having two particles in a particular 
energy state (p^ say) costs an energy 2 E P3 , i.e. double the single-particle 
energy. The total energy is given by Y^ m n Pm E Vmi where n Pm is the 
total number of particles in the state | p m ). 


n p 
3 - 
2 - 
1 - 


P 2 


~P 


Fig. 3.1 A two-particle state given by 
|P1P2>. 


3.2 Changing the notation 


As advertised, the first thing we’ll do is change the notation. In conven¬ 
tional quantum mechanics we label each identical particle (e.g. particle 
A, particle B, particle C) and list its momentum. This gives rise to states 
labelled like |ABC) = \p 1 P 2 P 1 ) for a three-particle state. We read this 
as ‘particle A in momentum state p \, particle B in state p 2 and particle 
C in state p\ . This is shown in Fig. 3.2 How could we streamline this 
notation? We haven’t yet used the facts that the particles are indistin¬ 
guishable (making the assignment A, B, C, etc. rather meaningless) and 
that there are only certain allowable values of momentum (i.e. 2Tvm/L). 
We can therefore decide that instead of listing which particle (A, B, C, 
etc.) is in which state p m , we could just list the values of the momentum 
p m , saying how many particles occupy each momentum state. For our 
example, we’d say ‘two particles are in momentum state p\, one is in 
momentum state p 2 . We therefore define a state of N particles by list¬ 
ing how many are in each of the momentum states. We will write these 
as \nin 2 nz ...). Our example is written 12100 ...). Specifying a state by 


n p 
3 - 
2 - 
1 - 


C. 

A. 

Pi 


B. 


P2 


-P 


Fig. 3.2 A three-particle state given 
by \pip 2 P 1 ) can be written as 12100 • • •) 
in the occupation number representa¬ 
tion. 
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Particles 

old 

new 

0 

|0> 

|00> 

1 

ki) 

|10) 

1 

92 > 

|01> 

2 

ki<?i) 

|20> 

2 


|H> 

2 

l<?2<?2> 

102} 

3 

kl9l9l) 

|30> 

and so on. 




Table of old and new notations. 


listing the number of identical particles in each quantum state is another 
instance of the occupation number representation, which we met 
in the last chapter. 


Example 3.1 

Let’s compare the old and new notation for a system with only two possible states: 
pi = 2n/L and p 2 = An/L. Into these states we can put any number of Bose particles 
that we choose. The table in the margin compares the old and new notations. 


What happens when we act on a state in occupation number represen¬ 
tation with the Hamiltonian? As we saw above, the answer needs to 
be 


H\nin 2 n 3 ...} = 


E' 


l Pm Ep n 


\nin 2 n 3 ...), 


(3.5) 


that is, we multiply the number of particles in each state by the energy 
of that state and sum over all of the states. This is obviously the right 
thing to do to find the total energy. 

Now for the reason why occupation numbers are useful. Equation 3.5 
can be made to look quite similar to something we’ve seen before. Recall 
that a quantum harmonic oscillator had a total energy given by E = 
(n + |) huj with n being some integer. Dropping the zero-point energy 
(which just amounts to a new choice in our zero of energy) we can 
write this as E = nfno. In words, the oscillator has n quanta in it. 
Crucially, the energy levels are equally spaced, so having two quanta 
in the system gives an energy which is twice that for a single quantum 
in the system. Now imagine N independent oscillators labelled by the 
index k. These might not be identical, so we say that the energy level 
spacing of the kth oscillator is hbj k . The total energy of this system 
is now E = n kfuA>k- So the fcth oscillator has nk quanta in it, 

contributing an energy npioJk to the sum. Compare this to our system 
of identical particles for which the total energy is E = y. Prn n p m E Pm - 
In words, this says that the momentum state p m has n Pm particles in 
it. Each particle contributes an energy E Pm giving a contribution of 
n Pm E Pr , for the momentum mode p rn . These two systems therefore 
have the same energy level structure. 

We have built an analogy between two completely different systems: 
harmonic oscillators and identical particles. 


1 We should be a little careful here; 
we haven’t physically replaced par¬ 
ticles with pendulums. We’ve sim¬ 
ply exploited the fact that the sim¬ 
ple harmonic oscillator energy levels are 
equally spaced, so, for example, two 
quanta in an oscillator have an energy 
that is twice that of a single quantum 
in that oscillator. 


SHO 


Identical particles 


Quanta in oscillators 
fcth oscillator 
E = E fc=1 n k huj k 


Particles in momentum states 
mth momentum mode p m 
E = T N , n v E v . 

Z—jm= 1 Pm Pm 


(3.6) 


As we’ll see in the next section, the real value to this analogy is that it 
can be pushed further. 4 
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3.3 Replace state labels with operators 


The notation change to occupation number representation is just win¬ 
dow dressing. We’re now going to make a more dramatic change in an 
attempt to almost remove state vectors altogether! We’ll do this by hav¬ 
ing operators describing the physics rather than state vectors. We’ll have 
to retain only one special state, the vacuum |0). Following eqn 2.43, 5 
we will build up a general state of several harmonic oscillators by acting 
on the vacuum state |0) and write 


\nW2---) = 

k 


1 

(rife!)5 


(4) nfc |o>- 


(3.7) 


c 

For the oscillator problem in Chap¬ 
ter 2 we had the very useful technol¬ 
ogy of creation and annihilation oper¬ 
ators to add and remove quanta from 
the SHO system, and we use the same 
idea here. 


Example 3.2 


We can build up a state with quanta arranged among various oscillators. A state of 
the system in which there are two quanta in oscillator number 1 and one quantum 
in oscillator number 2 (written in occupation number representation as |21000...)) 
can be written as 


| 21000 ...) 


1 

.Til 



Lvn 


|o>. 


(3.8) 


However, what we are doing here is not just talking about creating 
quanta in oscillators, but also particles in particular momentum states. 
We want a creation operator a' Pm to create one particle in a momentum 
state | Pm). It’s hopefully now just a matter of changing k —> p m to go 
from operators describing oscillators to operators describing particles in 
momentum states and we say that a' Pm creates a particle in momentum 
state \p m ) rather than a quantum in oscillator k. Before we can do this, 
we need to check that doing this doesn’t violate any of the symmetry 
principles we stated at the start of this chapter. That is, we need to 
know how this new formulation deals with Bose and Fermi particles. 


3.4 Indistinguishability and symmetry 

We now have to make sure that introducing creation and annihilation of 
states will respect the exchange symmetry principles for identical par¬ 
ticles. It will turn out that the basic technology is supplied by the 
commutation relations of the creation a£ m and annihilation a Pm opera¬ 
tors for a particular state. The key point is that it matters which order 
you use these operators to put particles into states. 


Example 3.3 

Consider a system with two momentum states pi and p 2 described in the occupation 
number representation as n \ 712 ) formed from creation operators acting on a vacuum 
state 10). We’ll define 


4il°> = |!0), 


4a|0> = I 01 )' 


(3.9) 



32 Occupation number representation 


Now let’s add another particle in the unoccupied state. This gives us 

al 2 ah |0) oc |11), “pi 10) oc 111}■ (3.10) 

The proportionality sign is because the constant remains to be determined. 


Whether you first put one particle in state pi and then another in state 
P 2 , or do the same thing in the reverse order, you must end up with the 
same final state: 111}. This means that 


a) pi a) p2 = A al 2 a f pi , 


(3.11) 


6 In Chapter 29 we examine the possi- where A is a constant. We assume two possibilities, 6 A = ±1, and they 
bility of other choices. correspond to wave functions which are either symmetric or antisym¬ 

metric under particle exchange. The particles are then called bosons 
or fermions and we will find that the two cases correspond to two dif¬ 
ferent types of commutation relation for the creation and annihilation 
operators. We will examine each case in turn. 

Case I: A = 1, bosons: In this case 




(3.12) 


Rearranging (and generalizing the labels of the momentum states to i 
and j) we find 

which means that the creation operators for different particle states com¬ 
mute. We also have [a*, aj\ = 0 and, importantly, we define 


= a\a)~ ojat = 0, 


(3.13) 


&i,a t 


— Sij, 


(3.14) 


so that we can use our harmonic oscillator analogy. We conclude that 
these commutation relations are the same as those for simple harmonic 
oscillators above. We’ve succeeding in developing a system to describe 
many particles! The formalism is identical to that we developed for 
many independent oscillators in the previous section. We can therefore 
build up a general state of many particles in momentum states using 

\nin 2 ■ ■ ■) = 7—~~T 7 T ) Hpm I 0 )- (3-15) 

(n„ 2 

m v '"Pm > 


The particles this formalism describes are Bose particles: we can put any 
number into quantum states and, as we’ll see later, they are symmetric 
upon exchange of particles. 

Finally we note that the commutation of different operators implies 
that 

4Al°> = a L a pil°) = (3-!6) 

that is, it doesn’t matter which order you put particles in the states, 
you get exactly the same in either case. 



3.4 Indistinguishability and symmetry 33 


In general we can write 

aj|«i •••«;•••) = VVii + llni-'-ni + l---), (3.17) 

«i|ni • • • rii ■ ■ ■) = y/rh\ni ■ ■ ■ rii - 1 • • •). (3.18) 

Case II: A = — 1, fermions: In this case we’ll write the fermion cre¬ 
ation operators c\ to distinguish them from the boson operators (written 
a\). Plugging in A = —1, we find that 

{4,4 } = ctct + c)c\ = 0, (3.19) 

where the curly brackets indicate an anticommutator. Fermion oper¬ 
ators anticommute: when you change their order you pick up a minus 
sign since cjcj = — qjcj. In particular, setting i = j we find 

ctct +c\c\ = 0 and hence cJcJ = 0. (3.20) 


In other words, trying to shoehorn two fermions into the same state an¬ 
nihilates it completely. This is the Pauli exclusion principle' which 
means that each state can either be unoccupied or contain a single 
fermion. We also have 

{Ci,dj} = 0. (3.21) 

Finally we define an anticommutator 

{cj,ct| = 5ij, (3.22) 

which enables us to use the simple harmonic oscillator analogy here too. 
Again we can describe putting particles in momentum states in the same 
way as putting quanta in independent oscillators. The only difference is 
that we use anticommutators rather than commutators. 

Finally we note that since cjc]|0) = — cj-cj|0), it really does matter 
which order you put the particles into the states. 


7 

The Pauli exclusion principle is named 
after Wolfgang Pauli (1900-1958), the 
Austrian theoretical physicist who was 
one of the pioneers of quantum mechan¬ 
ics, and also of quantum field theory 
and quantum electrodynamics, which 
were his main concerns after 1927. He 
referred to fellow-physicist Wolfgang 
Paul (1913-1993) as his ‘real part’. 
Pauli was merciless with his criticism of 
other physicists and their work, famous 
for his put-downs, one of the most well- 
known of which was that a particular 
scientific paper was ‘not even wrong’. 


Example 3.4 

Let’s check that the number operator still works for fermions. The number operator 
is hi = cjdi. We’ll see the action of hi on a state 111) = cjc^O). We expect that this 
is an eigenstate with eigenvalue 1 if all goes to plan because there is one particle in 
the i = 1 state. We’ll now proceed to check this: 


?n|n) = cjcil 11 ) 

= cjci(cjcj|0)) 

= cj(i — c|ci)c^|°> 
= cfc+IO) = 111), 


(using the anticommutation relation) 


(3.23) 


i.e. 111) is an eigenstate of h\ with eigenvalue 1 as we expected. 
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For fermions, in order to get the signs right, the normalization of creation 
and annihilation operators are defined as follows 

(-l) E Vl - rii\ni ■ ■ ■ rii + 1 • • •), (3.24) 

(-l) Si V / ^l n i--‘ ri i-I"-), (3-25) 

(_l)(«i+«2+-+n<_i)_ (3.26) 

This gives us a factor of —1 for every particle standing to the left of the 
state labelled n,; in the state vector. 


The square roots are redundant in 
eqns 3.24 and 3.25 since rii = 0 or 1, 
so they can be omitted if desired. 


where 


4 hi • 


h hi • 

■■«*■■■) 


(-i) 


Example 3.5 

Let’s check to see we’ve got it right when we come to exchange the particles. Remem¬ 
ber that for bosons we get the same state upon exchange and for fermions we pick 
up a minus sign. To exchange particles we need to imagine a sequence of operations 
which swap particles around. An example would be 


|110> 

a state with particles in states 1 and 2 

(3.27) 

|101> 

move particle from state 2 to 3 


|011> 

move particle from state 1 to 2 


1110} 

move particle from state 3 to 1. 



To move a particle we annihilate it from its original state and then create it in its 
new state. Moving a particle from state 2 to 3 is achieved using 110) The full 
sequence of events can be described with the string of operators 

a(a 3 a^aiaja2|110> = ±|110), (3.28) 

where we want the + sign for bosons and the — sign for fermions. 

Let’s check that the commutation relations give the correct results for swapping 
Bose particles. We can swap operators that refer to different states (since [d*, dj] = 0) 
to get 

aldad^didJ^lllO) = dsdJdjdid^a^lllO). (3.29) 

But ajdi is just the number operator hi, counting the number of states in a mode i. 

a 3 ajnin 2 |110) = (1) x (1) x d 3 dj|110>, (3.30) 

we need to turn a^a'^ into a number operator. Swap the order with the commutators 
to get 

(1 + aja 3 )|110) = (1 + n 3 )|110>, (3.31) 

and notice that n 3 |110) == 0 to obtain (1 + dgd 3 )|110) = 1110), as expected. 

Let’s now check that the commutation relations give the correct results for swap¬ 
ping Fermi particles. Now when we swap any two fermion operators we pick up a 
minus sign. Carrying out the same steps as before 

c{a 3 C2ClC3C2|110) = —C 3 C3c{ciC2C2|110) (3.32) 

= -(l-4cs)|110> 

= — I no) - 

Exchanging the particles gives us the required minus sign. 
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3.5 The continuum limit 

So far our particles have been confined in a box, but we will frequently 
wish to work in the continuum limit of the formalism. As the box size 
increases, the momentum states become more closely spaced until, in the 
limit, the momentum becomes a continuous variable. The Kronecker 6 t j 
functions we’ve been using become Dirac S^ip) functions and the sums 
become integrals. We have, for the continuous case, that 

[a p ,a\] = 5 {3) (p-q), 

and for energies 

H = J A 3 pE p a\,a p . 

Now that we have a working theory where the commutation properties 
of the operators contain all of the information about the states, let’s 
check that it works and gets the correct rules for inner products and for 
position space wave functions. 


(3.33) 

(3.34) 


Example 3.6 

For single-particle states, we 

have 



<p|p'> = (0|dp(d,|0). 

(3.35) 

We use the commutation relations to get 


(p\p') 

= (01 [<5 (3) (p-p') ± dt ( ap] |0) 

= 5( 3 \p— p'), 

(3.36) 


which is the right answer. We can check this by working out a position space wave 
function. We’ll start by making a change of basis 8 \x ) = f d 3 q<p* (x)\q), which in¬ 
volves expanding a position state in terms of momentum states. Using this expansion, 
we obtain 

(x\p) = J d 3 qcj) q (x)(q\p) = (j> p (x). (3.37) 

This is clearly okay, but rather trivial. More interesting is the case of a two-particle 
state 

(p'q'\qp) = (0|a p /a q /atat |0). (3.38) 

Commuting, we obtain 

(p'q'\qp) = s (3 \p' - p)S ( - 3 ' , (q' - q) ± <5 (3) (p' - q)<5 (3, (q' - p), (3.39) 

where we take the plus sign for bosons and minus sign for fermions. Again we can 
work out the position space wave function by making a change of basis \xy) = 
-^= f d 3 p'd 3 q' </>*, (£c)0*/ (y)\p'q') (where the factor of -^= is needed to prevent the 
double counting that results from the unrestricted sums). This gives us 

J d 3 p'd 3 q'<t> p ,(x)<t> q Ay)(p'q'\pq) = T [<t>p{x)(f> q {y) ± <t> q (x)<i>p{y)\ , (3.40) 

which is the well-known expression for two-particle states. 


8 The state \x) can be written in a new 
basis by inserting the resolution of the 
identity in the form 

1 = J d 3 q\q)(q\. 

so that 

I*) = J d3 q\q)(q\ x ), 

and writing (x\q) = (f)q(x) [and hence 
(q\x) = <f> q (x)) yields 

\x) = j d 3 qtf>*(x)\q). 


It works. We have successfully reformulated quantum mechanics. The 
effort we’ve spent here writing everything in terms of creation and anni¬ 
hilation operators will soon pay off. The machinery they represent is so 
useful that we’ll try to build the quantum fields we’re searching for out 
of these objects. As you might suspect, this search will be successful. 
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Chapter summary 

• The occupation number representation describes states by listing 
the number of identical particles in each quantum state. 

• We focus on the vacuum state |0) and then construct many-particle 
states by acting on | 0 ) with creation operators. 

• To obey the symmetries of many-particle mechanics, bosons are 
described by commuting operators and fermions are described by 
anticommuting operators. 


Exercises 


(3.1) For boson operators satisfying 

= Spq , (3.41) 

show that 

1 £V(P—*■») [a p ,4] =5 ^(x-y), (3.42) 

pq 

where V is the volume of space over which the sys¬ 
tem is defined. Repeat this for fermion commuta¬ 
tion operators. 

(3.2) Show that for the simple harmonic oscillator: 

(a) [a,(« t r]=n(a t r- 1 , 

(b) ( 0 |a”(a t ) m | 0 ) = n\5nm, 

(c) (m|a f |n) = sjn + 15 m , n +i, 

(d) (m|a|n) = ^/nS m ,n- 1 - 

(3.3) The three-dimensional harmonic oscillator is de¬ 
scribed by the Hamiltonian 

H= ^(tfi+pl+pl) + \muj 2 (xl+xl+xl). (3.43) 
Zm Z 

Define creation operators a{, a\ and S 3 so that 
[Si, a]] = 8ij and show that H = Juo 5Di = 1 (|+S|Si). 
Angular momentum can be defined as 

L 1 = —ihe xjk a^ak, (3-44) 


so that for example Z, 3 = —ih(a\a 2 — St,Si). Define 
new creation operators 


b\ 

= =(a{+ia\), 


£t 

°o 

= 


-1 

= ^(ol-i 4). 

(3.45) 


and show that [S;, St] = <5y, the Hamiltonian is H = 
^Em=-i(l+^mV) and L 3 = 1 rnb]J) m . 


(3.4) Since for fermions we have that cjcjlO) = —CjcjlO), 
then the state 111) depends on which fermion is 
added first. If we put the first fermion into the 
state ipi(ri) and the second fermion into the state 
i/jj(r 2 ), a representation for 111) is 


'F(n,r 2 ) = —j= [ipi(ri)tlj 2 (r 2 ) - ip2(ri)i/) i(r 2 )]. 

V 2 

(3.46) 

Show that a generalization of this result for N 


yM 


4*(ri • • • 

rN ), which 

can be written out 

tpi(ri) 

1 P 2 (n) 

V’Ar(f'l) 


ipi(r 2 ) 

ip2(r 2 ) ■ 

ipN(r 2 ) 


ipi(r N ) 

ip2(r N ) ■ 

•• V’Jv(rjv) 



(3.47) 


This expression is known as a Slater determi¬ 
nant. 




Making second 
quantization work 



In this chapter we will bridge the gap between first and second quan¬ 
tization. This will result in new operators that act on states given in 
the occupation number representation. For this discussion we’ll limit 
ourselves to non-relativistic particles in a box. 1 By working in the non- 
relativistic limit, we are taking a shortcut and thereby avoiding some 
important complications. A fuller, relativistic treatment will be pre¬ 
sented in Chapter 11, but the approach taken in this chapter gives a 
route to some immediately useful results and in particular to the be¬ 
haviour of electrons in a solid. 


4.1 Field operators 

In the discussion so far we have introduced creation and annihilation 
operators a' p and a p which create and annihilate particles into particu¬ 
lar momentum states. So if we have the vacuum state |0) and we apply 
a], to it, then there is a big puff of smoke and a flash of light and the 
resulting state aj,|0) contains a single particle sitting in the momentum 
state p. Being a momentum eigenstate it is of course extended in space 
(but completely localized in momentum space). Now, by making appro¬ 
priate linear combinations of operators, specifically using Fourier sums, 
we can construct operators, called field operators, that create and an¬ 
nihilate particles, but this time they don’t create/annihilate particles in 
particular momentum states but instead they create/annihilate parti¬ 
cles localized at particular spatial locations. Thus the operator ip'{x) 
defined by 

= ( 4 -7) 

v p 

creates a particle at position x , while a) p creates a particle in a state 
with three-momentum p. Similarly, the operator ip(x) defined by 

^x) = -^'£a p e*-*, (4.8) 

V p 

annihilates a particle at position x, while a p annihilates a particle in a 
state with three-momentum p. In summary: 
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^We retain the ‘particle-in-a-box’ 
normalization of Chapter 3. States are 
defined in a box of volume V (using 
periodic boundary conditions). For 
later convenience, we list some useful 
results using this normalization: 

A state | a) can be described in posi¬ 
tion coordinates •?/;«(*) = (aj|a) or mo¬ 
mentum coordinates 'ip a (p) = (p|a:), 
and since 

(p\a) = J d 3 x (p\x) (x\a) , (4.1) 

then the Fourier transform 


«Mp) = j d 3 ie ipx ip a (x), 


implies that 

(4.2) 

(p\x) = ^~ ipx - 

(4.3) 

The inverse transform is 


V v p 

(4.4) 

since p takes discrete values. We also 
have 

j d 3 xe~ ipx = V5 P ,o 

(4.5) 

and 


l£e ip *=S(V(x). 

V 

(4.6) 
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The field operators and ^(x) respectively create or destroy a 

single particle at a point x. 

The following examples explore some properties of field operators. 


We use the result 

(y\p) = * y . 

which follows from eqn 4.3 that states 
that the projection of a momentum 
eigenstate in position space is a plane 
wave. The last equality in eqn 4.13 fol¬ 
lows directly from eqn 4.6. 


Example 4.1 

We examine the state |4/) = |0) and check that it does indeed correspond to a 

single particle at a particular location. Note that 

l*> = ^( X )\0) = -L^e-^-41 0 ). (4.9) 

V v p 

To calculate the total number of particles in this state we can use the number operator 
rip = dpdp (which measures the number of particles in state p) and then sum over 
all momentum states. Consider 

£**!*> = ■4E4 a 94i°> e_ip '". ( 4 - 10 ) 

q * k qp 

and again using (0|dgdp|0) = S pq we deduce that 

£n,|*> = |*>, (4.11) 

<? 

and the state 4/ is then an eigenstate of the number operator with eigenvalue 1. 

To find out exactly where the particle is created, first define | p) = dp|0) and then 
we can look at the projection of |4/) on another position eigenstate | y): 

(ym = (y\^(*m = ^=Ee _ipiC (3/IP> 

v V p 

= (4.12) 

P 

= 5 (3) (aj-y), 

showing that the single particle created by ( x ) may only be found at y = x. 


Example 4.2 

The field operators satisfy commutation relations. For example 

[</,(*), ^(y)] = i^e i (P“-''« ) [a p ,at], ( 4 . 13 ) 

pq 

and using [d p , dq] = 5 pq we have that 

[j,{ X ),tf{y)\ = lj2 ei(P ' X ~‘ VV)S P 1 = jE e?P ' ( *- ,,) = 5 W( X -y), ( 4 . 14 ) 

pq p 

where the last equality follows from eqn 4.6. By the same method you can show 

[^ f (a :),^ t (y)] = [v , (a=), - 0 (y)] = 0 . (4.15) 

These results are for boson operators, but we can derive analogous results for fermion 
operators: 

|^(a:),^ t (y)J = 6 (3> (x-y), 

|v) t (a:),^ t (y)| = {^(a;), V>(y)} = 0 . ( 4 . 16 ) 
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4.2 How to second quantize an operator 


How can we upgrade the operators that we had in first-quantized quan¬ 
tum mechanics into second-quantized operators? Let us first remind 
ourselves what an operator A does. It maps a quantum state | ip) (which 
lives in a Hilbert space 2 and describes the quantum mechanical state 
of a single particle) into another state A\i/j). We will focus in this section 
on A being a single-particle operator, an example of which would be 
p x = —iftV = — which operates on a single coordinate aq that 
can only describe the position of one particle. Thus we will ignore for 
now objects like the Coulomb potential operator V(x 1 ,x 2 ) = i7T \ x l- X2 \ 
which depends on the coordinates of two particles. An operator that 
acts on n particles is called an n-particle operator. 

The operator A can be described quite generally using a variety of 
coordinate systems, as can be seen if you insert two resolutions of the 
identity, 1 = l a )( a l an< ^ = L /3 \P)(P\ into the right-hand side of 
the trivial identity A = A: 

A = J2 |a)(a|A|/3)(/3| = ]T A a p\a){(3\, (4.17) 

a(3 ot/3 


where the matrix element A a /3 = {a\A\(3). This is all well and good, but 
quantum field theory is acted out on a larger stage than Hilbert space. 
We need a space that includes not only single-particle states, but the 
possibility of many-particle states. This is known as Fock space. 3 

Remarkably, the second-quantized many-body upgrade A of the 
single-particle operator A to Fock space is a very compact expression: 


A ^ ) Aa/3 a) 

a/3 


a/3- 


(4.18) 


This equation has the same single-particle matrix element A a /3 = 
(a | .41/3) that we had in eqn 4.17, even though it is now valid for the 
many-particle case. The interpretation of eqn 4.18 is beautifully simple. 
The operator A is a sum over all processes in which you use ap to remove 
a single particle in state 1/3), multiply it by the matrix element A a p, and 
then use a)* to place that particle into a final state |a). This operator A 
thus describes all possible single-particle processes that can occur and 
operates on a many-particle state (see Fig. 4.1). The proof of eqn 4.18 
is rather technical and is covered in the following example, which can 
be skipped if you are happy to take the result on trust. 


Example 4.3 

To prove eqn 4.18, we need to introduce some formalism. Let us first write down an 
expression for an TV-particle state 

1 N 

IVn, ■ • • = -r= II IWyi)>- (4.19) 

v-W- p i=1 


2 A Hilbert space is a complex vector 
space endowed with an inner product, 
whose elements look like an\4>n) 
where a n are complex numbers. It can 
thus describe particular superpositions 
of states such as, just for an example, 
(\<f>o) + i|0i))/v / 2 in which the system 
might equally be found in state |</>o) or 
\(pl), but the quantum amplitudes for 
being in each state are n/2 out of phase. 
This is a superposition of possibilities, 
but it only describes a single particle. 

Q 

Fock space F includes Af-particle 
states for all possible values of N. 
Mathematically, one can write this as 

oo 

F = Fn = Fo 0 Fi 0 • • • , 

N =0 

where Fo = {10)} is a set contain¬ 
ing only the vacuum state and F\ is 
the single-particle Hilbert space. The 
subspaces Fn> 2 > which describe N- 
particle states with N > 2, have to 
be symmetrized for bosons and anti¬ 
symmetrized for fermions (so, for ex¬ 
ample, F<z is not simply F\ (8> F \, but 
only the symmetric or antisymmetric 
part of F\ (g> F\). Creation operators 
move an element of Fjy into an ele¬ 
ment of Fn- |-i, while annihilation op¬ 
erators perform the reverse manoeuvre. 
An element of F can describe a super¬ 
position of states with different particle 
numbers. 





Fig. 4.1 A process represented by the 
operator in eqn 4.18. 
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5 5_0_J_2_JLjL 
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v 2 2_3_4_ 5_0_J_ 

1 J_2_3_4_5_0_ 

0 |o 11 |2 |3 |4 Is 
0 1 2 3 4 5 
P 

Fig. 4.2 For N = 3, a sum over per¬ 
mutations P and Q is equivalent to 
N\ copies of a sum over permutations 
P' = P + Q. 

1 ’ll 1 e right-hand side of eqn 4.21 can 
be written det((xi \<Pj)) for fermions or 
per((xi\ipj)) for bosons. The determi¬ 
nant of a matrix Aij is 

N 

det(A) = X^(-1) P ]4 AiP(i)- 
P i=l 

The permanent of a matrix A , :l is 
N 

per(A) = En 
P i=l 



Fig. 4.3 The action of a creation oper¬ 
ator on a state |?/q,... ,iPn)- 


This is a sum over all N\ permutations of single-particle states I'lpi). There are N\ 
permutations of the sequence of integers i = 1,..., N and P(i) labels a permutation 
of this sequence. The factor £ = 1 for bosons and £ = — 1 for fermions. For fermions, 
the notation £ p gives a plus or minus sign for an even or odd permutation respectively 
(and for bosons, of course, the factor can be ignored as it is always +1). The inner 
product of two such iV-particles states is 

1 N 

(xi>• • • >Xivl^i>• • • »^jv> = ( 4 - 20 ) 

' P Q i= 1 

We can rewrite this using P' = P + Q which spans all the permutations N\ times, 
see Fig. 4.2, and hence 

N 

<Xi, • • -,XN IV’i, • • -An) = e« p n (XilV’P'w)- (4.21) 

P' i =1 

If desired, this rather complicated expression can be written rather compactly as a 
determinant (for fermions, or a ‘permanent’ for bosons). 4 

Let’s now try using a creation operator to act on one of these 7V-particle states: 

nJ.IV’i, • • -An) = ■ -An)- (4.22) 

Thus the cuckoo lays its \4>) egg in the |?/q,..., i/jn) nest, see Fig. 4.3. Annihilation 
is somewhat harder to formulate, mainly because you have to cater for the possibility 
that the state you are annihilating is some admixture of states already present. To 
proceed, it is helpful to realize that a$ is the Hermitian conjugate of a^. Thus 

(xi> • • ■ > XN-i\a^\tpi, • • -, i/)n) = (' ! / , i> • • • > V’ivl^Jjxi; • • • > Xjv— l}* (4.23) 

= {ipi, • • •, iI’n\<P> Xi) ■ • • j Xjv-i)* 

W>i|</>} Wrlxi) ••• (V’iIxjv-i) 

(V’jv l</>) (V’ivlxi) ••• (V’jvIxat-i) ? 

This last line has been obtained by recognizing that the expression to evaluate is a 
determinant (fermions) or permanent (bosons). It can be expanded as a series of 
minors: 

(xi> • ■ • iXN-ila^ipi,..., tp N ) 

N 

= E ■■ (no ip k ). ..ip N \xi,---,XN-l)* 

fci (4.24) 

N 

= > XN-l\fl,- ■ • (no ip k )... i/j n ). 

k =1 

This is true for any premultiplying (xi, • • • ,XN— i| 5 and so 

N 

a<f>\ipi,. ■ •, 'iPn) = ^2 ■ • (no 'ipk )... 'ipjsr). (4.25) 

k =l 

This means that a combination of annihilation of | (3) and creation of |a) gives 

N 

• •. ,^jv) = • • • (no ip k )... Vw), (4.26) 

k =l 

but now we can simply translate the inserted state |a) into the position formerly held 
by the annihilated state \^k) and thereby remove the annoying 1 factor, since 

£ fc ~ 1 K'0i, • • ■ (no 4> k ). ..ip! v) = \ipi,. ■ .ip k -i,a, ip k+1 ,. ..iPn) (4.27) 

and hence 

N 

“a“/3 I’/’l, ■ • • ,4>n) = E ■ ■ ■ ’•Pk-I,a, ipk+ 1 , • • • V'Jv)- 

k=l 


(4.28) 
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After all this effort, we are now ready to tackle the real problem. Given a state 
..., iPn) and a single-particle operator A = A a p\on)((3\, the obvious second- 

quantized upgrade is to let A act separately on each individual component state 
inside |-0i,... , ^>jy) an( l then sum the result. For example, if A is the single-particle 
momentum operator, we would want A to be the operator for the total momentum, 
the sum of all the momenta of individual particles within the many-body state. 

Let’s make it very simple to start with and set A = \a)(/3\. In this case, for a 
single-particle state \i/jj) we would have A\ifij) = (f3\'ijjj)\a) and hence 

N 

A\^1 •••Vw) = y>h/h-)hh,.. ^fc+i, • • • ^jy)- (4.29) 

3 = 1 

But this is simply ... ,^iv) as we found in eqn 4.28. Hence A = a^ap. In 

the general case of A = ^2 a p A a p |a)(/?|, we easily then arrive at 

A = ^ Aap o3 a ap. (4.30) 

a(3 


After the hard work of the previous example, which justified eqn 4.18, 
we now have the power at our fingertips to do some splendidly elegant 
things. 


Example 4.4 

(i) The resolution of the identity 1 = o:) (o is a special case of A = 

Ea/3 •A a f}\ci){P\ with A a f 3 = S a p. Its second-quantized upgrade is immediately 

n = y~] ajda, (4.31) 

OC 

the number operator, that counts one for every particle in the state on which it 
operates. 

(ii) The usual momentum operator can be written A = p = ^2 p p\p)(p\ and this 
upgrades immediately to 

P = ^pa^cij, = ^pfip. (4.32) 

P V 


In much the same way, a function of the momentum operator A = f{p) becomes 

A = ^2f(p)ala p = ^2f(p)n p . (4.33) 


One particular example of this is the free-particle Hamiltonian which one can write 
down immediately in second-quantized form as 

" 2m (4 ' 34) 

P 

Equation 4.34 can be thought about as the simple diagram shown in Fig. 4.4. Since 
the occupation number states (with which we want to work) are eigenstates of n p then 
so is H. We have found that the operator H is diagonal in the basis in which we’re 
working. In fact, to diagonalize any Hamiltonian (which means to find the energies 
of the eigenstates) we simply express it in terms of number operators. Having the 
Hamiltonian in this form makes perfect sense. We’re saying that the total energy in 
the system is given by the sum of the energies p 2 /2m of all of the particles. 

(iii) What happens if our operator is a function of *, rather than p? Well, eqn 4.18 is 
valid even if our states |ai) and |/3) are position states. Our creation and annihilation 
operators are simply field operators, and the sum over momentum values becomes 
an integral over space. We then can write down our second-quantized operator V as 



d 3 x ^ (x)V(x) ip(x). 



Fig. 4.4 A process represented by the 
„ 2 

Hamiltonian H = 2 ^™P" Sum¬ 

ming over all momentum states involves 
repeating this counting process which 
can be imagined as counting sheep 
crossing through a gate (annihilate a 
sheep on the left-hand side of the gate, 
count its energy in the sum, create the 
sheep again on the right-hand side of 
the gate, now move on to the next sheep 
...). 


V 


(4.35) 
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If we want, we can express this back in momentum space using eqns 4.7 and 4.8, and 
hence 

v= £ fd 3 x <ip l e~ iPl x V(x) a P2 e iP2 ' x = £ V Pl - P2 d\, i a P2 , (4.36) 

PlP2 PlP2 

where V p = y f d 3 xV(x)e~ ip x is the Fourier transform of V(x). The operator 
V isn’t diagonal, since the a operators create and annihilate particles with different 
momenta. We can think of this term as representing a process of an incoming particle 
with momentum p 2 which interacts with a potential field (represented by V Pl — P2 ) 
and then leaves again with momentum p± (see Fig. 4.5). 


Fig. 4.5 A process represented by 
eqn 4.36. The sequence of events is: an¬ 
nihilate a particle with momentum p 2 , 
count the potential energy V Pl — P2 > cre¬ 
ate a particle with momentum p 1 . 


Example 4.5 

Let’s have a look at the influence of the potential on a simple system described by a 
Hamiltonian 

H = Eo Y, d t d P - \ E d k d P2 ■ ( 4 - 37 ) 

p P 1 P 2 

We’ll further constrain the system to have three energy levels, so states are expressed 
using a basis \n pl n P2 n P3 ). For simplicity, let’s first set V = 0. In this case, if we 
put a particle into the system then the three possible eigenstates of the system are 
1100), 1010), 1001). Now we put V / 0, and so 


V 


- dj^dpallOO) = — (|100) + |010) + |001» , 


(4.38) 


PlP2 

and similarly for the other states (try them). 

At this point it’s easiest to slip into matrix notation, for which the entire Hamil¬ 
tonian takes the form 


H = 


Eo 


V 

~2 


(4.39) 


The eigenvalues for this equation are £ = E 0 , Eo , Eo - 2 ”. The ground state of the 

system (called |f2)) therefore has an energy Eo — The corresponding eigenstate 
is 

|010) + |001». (4.40) 


1 


|Q) = —(|100)- 

We conclude that the ground state is a special 1:1:1 superposition of the three states 
we started with. 


Example 4.6 


What is the equivalent of the probability density of the wave function in second- 
quantized language? The answer is that the number density of particles at a point 
x is described by the number density operator p(x) given by 

p(x) = (x)ifi(x) 

= y £ [e-fri-**>-] at i<W (4.41) 

V PlP2 


This operator enables us to write the potential energy operator for a single particle 
in an external potential as 


V = J d 3 xV(x)p(x). 


(4.42) 
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4.3 The kinetic energy and the 
tight-binding Hamiltonian 

One application of this formalism is to the tight-binding Hamiltonian 
which is used in condensed matter physics. The basic idea behind this 
model is very straightforward: electrons don’t like being confined. We 
know this from the well-known problem of a one-dimensional infinite 
square well with width L. There the kinetic part of the energy is given 
by E n = ( 17 ) ■ The smaller we make L, the larger the energy. 

Conversely, kinetic energy can be saved by allowing the electron to move 
in a larger volume. In the tight-binding model, one considers a lattice of 
fixed atoms with electrons moving between them (as shown in Fig. 4.6). 
These electrons can lower their kinetic energies by hopping from lattice 
site to lattice site. To deal with the discrete lattice in the model, we 
need to work in a basis where the fermion creation operator c\ creates 
a particle at a particular lattice site labelled by i. The kinetic energy 
saving for a particle to hop between points j and i is called tij . Clearly 
tij will have some fundamental dependence on the overlap of atomic 
wave functions. The Hamiltonian H contains a sum over all processes 
in which an electron hops between sites, and so is a sum over pairs of 
sites: 

£ = ( 4 - 43 ) 

ij 

Each term in this sum consists of processes in which we annihilate a 
particle at site j and create it again at site i (thus modelling a hopping 
process), thus saving energy tij. For now, we’ll consider the simplest 
possible case of t l3 = t for nearest neighbours and t = 0 otherwise. Thus 
we rewrite H as 

H = -t^2c\c i+T , (4.44) 

IT 

where the sum over r counts over nearest neighbours. 


Fig. 4.6 The operator c[dj causing an 
electron to hop from site j to i. Such a 
process saves kinetic energy tij. 


Example 4.7 

The tight-binding Hamiltonian is not diagonal since we have a bilinear combination 
of operators that look like c\c-j. We need to diagonalize it and that can be achieved 
by making a change of basis (again, essentially using a Fourier transform). Thus 

and £ i = 4E ri,T ‘4- ( 4 - 45 ) 

V k V q 

Substituting this into the Hamiltonian yields 

H = -^J2J2 e ~ Kq ~ k) ' ri ~ iqrT 4^- (4.46) 

ir kq 

We play our usual trick and exploit the fact that ^ e~ 1 ( g ~ fc )' r * = 5 q k, which 

removes the sum over i. Lastly we do the sum over q. After jumping through these 
familiar mathematical hoops we end up with 


H~-tj2J2 eikTT£ l d >°’ ( 4 - 47 ) 

k T 

which is diagonal in momentum. 
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We can tidy up this result by writing the Hamiltonian as 



Fig. 4.7 The tight-binding model for 
a two-dimensional square lattice. Con¬ 
tours showing lines of constant energy 
Ek based on eqn 4.50 are plotted as a 
function of k x and k y . The lowest E & 
occurs at (k x ,k y ) = (0, 0), the centre of 
this plot, for t > 0 and at the corners, 

(k x ,ky) = (±f,±f) for t < 0. 


H = Y, E kc{c k , (4.48) 

k 

where the dispersion E & is given by 

E k = -J2^ ikrr - (4.49) 

r 


Example 4.8 

For a two-dimensional square lattice, the sum over r counts over vectors (a, 0), 
(—a,0), (0, a) and (0, — a). This leads to a dispersion relation (see also Fig. 4.7) 

E*. = — 2£(cos k x a + cos k y a). (4.50) 


4.4 Two particles 


We have been focussing on single-particle operators. What happens for 
the more interesting case of two particles interacting with each other? 
From the discussion above we can guess (quite correctly) that a second- 
quantized two-particle operator A is given by 

A= ^ Aap-ysaia^a^as, (4.51) 

cx/3 / yS 


where A a 0-yS = (a,/3|7, 6} is the matrix element for the Hilbert-space 
version. This expression embodies the idea that two-particle operators 
involve a sum over all processes involving the annihilation of two parti¬ 
cles in particular states, multiplication by the matrix element, and cre¬ 
ation of two particles in two new states. We will frequently encounter 
two-particle operators V written as a function of spatial coordinates, 
and the corresponding expression involving field operators is 

V = ^ j d 3 xd 3 yft(x)ft(y)V(x,y)i>(y)-ii)(x). (4.52) 

The factor of \ ensures that we don’t double count the interactions. 
Notice the way we have arranged the operators: we put the creation 
operators to the left and the annihilation operators to the right, a con¬ 
vention which is known as normal ordering. Why is this important? 
Normal ordering makes the evaluation of matrix elements much easier 
and in particular it makes sure that our operator, V, has zero vacuum 
expectation value: 


(0|V|0)=0. 


(4.53) 
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This avoids us to having to blush over embarrassing infinite self-energy 
terms that can otherwise creep into our answers. With creation oper¬ 
ators to the left and annihilation operators to the right we make sure 
that (0| W|0) is fixed at zero. 

An additional convention we have adopted in eqn 4.52 is that read¬ 
ing the coordinates from right-to-left gives us x,y,y,x. (This can be 
remembered as the dance step: left-in, right-in, right-out, left-out.) For 
bosons the order would not matter, but fermions anticommute and so for 
example interchanging the first two creation operators would introduce 
a minus sign. 

It is useful to see what would happen if we hadn’t introduced normal 
ordering. For example, an alternative guess for the form of the two- 
particle interaction might have been 

Kvrong = ^ / d 3 xd 3 y V (x , y)p(y)p(x) . (4.54) 

This form is wrong and, because we’ve left out the all-important normal 
ordering, will lead to an extra self-energy term, as shown in the following 
example. 


Example 4.9 

We will consider the case of fermions and will need the anticommutation relations 
x),ipHy)^ = 8( 3 l(x — y) and V>(y)| = 0. Now consider the operator 

p(x)p(y), which is given by 

p(x)p(y) = (x)ip(x)ipi (y)4>(y). (4.55) 

Notice that this is certainly not normal ordered, but we can make it so as follows. 
Swap the second and third operators. This involves using an anticommutator. We 
obtain 

p(x)p(y) = -fl(x)fl(y)ii>(x)ip(y) + S (3) (x - y)fl (x)ip(y) 

= ^ (x)^ (y)'ip(y)'ij)(x) + S (3, (x - y)ifi^(x)'ip(y), (4.56) 

where, in the last line, we’ve made another swap (this time of the two left-most 
operators, involving a second sign change). We are left with a normal ordered term 
plus a term involving a delta function. Comparing this to the correct expression for 
the two-particle interaction, we have 

Vwrong = V + - J d 3 xV(x, x)^ (x)^{x). (4.57) 

The ‘wrong’ form of the potential energy includes an unwanted self-energy term. 
Normal ordering ensures we dispose of this abomination. 



As before, we continue the procedure of second quantization by substi¬ 
tuting the mode expansion into our equation 

v = * J d 3 xd 3 yft(x)ft(y)V(x,y)’il>(y)-ii)(x) 

= 2^2 / d 3 £ d 3 y e i (- pi ' x ~ P2 " y+P3 " y+P4 ' x ^ £ 0 ^^* - y^Op,. 

PlP2P3P4 


(4.58) 
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Fig. 4.8 A Feynman diagram for the 
process described in the text. Inter¬ 
preting this diagram, one can think of 
time increasing in the vertical direction. 


^As we will see, not only does this 
tell us about seemingly pedestrian mat¬ 
ters concerning the energies of elec¬ 
trons in metals, but it also describes 
interactions that lead to superfluidity, 
superconductivity, magnetism, charge 
density waves and pretty much every 
other phenomenon you’ve ever encoun¬ 
tered in condensed matter physics. Of 
course, the fact that eqn 4.63 cannot 
generally be diagonalized should give 
us pause. We need a programme to 
turn the two-body potential diagonal. 
As we will see in Part X, we do this 
in two stages: firstly we massage our 
Hamiltonian until it’s made up of bi¬ 
linear combinations of operators. We 
may then make transformations until 
we have number operators. 

6 The Hubbard model is named after 
John Hubbard (1931-1980). 


Example 4.10 

To deal with this, we define z = x — y and eliminate x to obtain 

v = 4l4 2 a P3«P4 [ f d 3 J/ e'( — Pl —P2+P3+P4)'» 

P 1 P 2 P 3 P 4 

(4.59) 

The last integral gives us a Kronecker delta VJ Pl +p 2 — p 3 ,p 4 5 which can be used to 
eat up one of the momentum sums (the p 4 one here) and sets p 4 = p 1 + p 2 — P 3 - 
Now we have 

T 4i a p 2 “P3 a Pi+P2-P3 f d3 zl , W e " l(P3 " 1 ’ a) ' z . (4.60) 

v P 1 P 2 P 3 J 

The last integral is the Fourier transform of the potential 

1 j d 3 Z V'( Z )e- i (P3~P2)- = Vp 3 - P2 . (4.61) 

Now for some interpretation: set q = p 3 — p 2 an d eliminate p 3 , yielding 

F = - Vqdp 1 dp 2 dp 2 +qdp 1 - q - (4.62) 

PlP2<l 

Lastly we reindex the sum subtracting q from p 2 and adding it to Pi . 


The result of these index acrobatics is a two-particle interaction potential 
that looks like 

^ = 2 E ^ a P 1 +«4 2 -« a p a a p 1 - ( 4 - 63 ) 

P1P2Q 

This can be interpreted as a scattering problem in momentum space. 
This scattering is illustrated in a cartoon known as a Feynman diagram. 
The diagram in Fig. 4.8 is translated into a story as follows: A particle 
comes in with momentum p 2 . It sends out a force-carrying particle 
with momentum q , reducing its final momentum to p 2 — q. The force¬ 
carrying particle is absorbed by a second particle (which originally has 
momentum jq) and ends up with a final momentum of p 1 + q. Notice 
we conserve momentum at the vertices of the diagrams. 

Our aim is going to be to turn all of our combinations of operators into 
number operators. This makes evaluating the energy a simple matter. 
The first thing to notice about eqn 4.63 is that it is not diagonal. In 
general, problems involving the potential energy (as in eqn 4.63) cannot 
be solved exactly. Despite this, the two-particle potential leads to a 
remarkably rich seam of physics. 


4.5 The Hubbard model 

As this chapter’s final example of a second-quantized Hamiltonian, we 
turn to the Hubbard model. 6 It is difficult to overemphasize the im¬ 
portance of this model in solid state physics, due to the fact that it 
captures the essential physics of the competition between kinetic energy 
(favouring electrons to become delocalized) and potential energy (favour¬ 
ing electrons to become localized). Though relatively easy to state, the 
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Hubbard model is surprisingly complicated to solve and exact results are 
only available in certain scenarios. The Hubbard model is based on a 
tight-binding Hamiltonian (to model the kinetic energy) plus a two-body 
potential energy term (to model electron-electron interactions): 

H = Y^{-tij)c\cj + 4 c)v ijk ic k ci. (4.64) 

ij ijkl 

Recognizing that electrons carry a spin a which can point up (| f)) or 
down (II)) this becomes 

H = + 2 CicrCja'VijklCka'Clcr. (4.65) 

ij<7 ijklaa' 

We assume in this model that no spin-flip processes can occur; thus 
spins never flip from up-to-down or vice versa on hopping. However, the 
interaction between electrons is based upon the Coulomb interaction 
between their charges and so electrons with opposite spin will interact 
with each other just as much as electrons with the same spins. Now for 
Hubbard’s model: the Coulomb interaction is assumed to be significant 
only between two electrons that are on the same site. These electrons 
interact via a constant potential energy U = Van . However, the Pauli 
exclusion principle ensures that should two spins hop into the same 
site they must have different spins, and this is where spin does make a 
difference. The Hamiltonian for the Hubbard model is then 

H = + uY^fii-fhn, (4.66) 

ijcr i 

which looks simple, but the eigenstates tend to be complex and highly 
correlated. It is often only necessary to consider hopping elements for 
nearest neighbours. 


Example 4.11 

Consider the Hubbard model with nearest-neighbour hopping only, and to make 
things as simple as possible, consider a lattice of only two sites. If, in addition, we 
only have one, spin-up electron, then that single electron can either be on the first 
site (a configuration we will denote as | 1,0), meaning ‘spin-up electron on site 1, 
nothing on site 2’) or on the second site (a configuration we will denote as |0, T))- 
Hence a general state can be written as the superposition \^) = a\ 1,0) + 6|0, f), 
where a and b are complex numbers. In this case, there’s no possibility of potential 
terms (because there is only one electron) and in this basis the Hubbard Hamiltonian 
can be written 

^=(-° t 0*)- ( 4 - 67 ) 
Solving gives us a ground state \ijj) = ~^= (| 1,0) + 10, f)) with energy E = —t and an 
excited state \tp) = ~^= (| t,0) — 10, f)) with energy E = t. 

Now we’ll have two electrons in the system. If they have the same spin the answer’s 
simple and E = 0 (try it). If the electrons have different spins we write a general 
state as |i/>) = a\ fj,,0) + b\ f,J,) + c| J,, T) + ^|0»Ti) an d in this basis the Hubbard 
Hamiltonian is 



u 

-t 

-1 

0 

\ 


-t 

0 

0 

-t 



-t 

0 

0 

-t 


V 

0 

—t 

-t 

u 

) 


H = 


(4.68) 
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This can be diagonalized to give a ground state energy of E = E — 4 (U 2 + 16f 2 ) 2 
which corresponds to a wave function | ip) = IV (I Tb 0} + IF| T, J.) + W| J., T) + |0, TJ.)) 
with N some normalization constant and IT = (, -[ ), (U 2 + 16t 2 ) 2 . 


In this chapter we have considered models on discrete lattices which are 
relevant to condensed matter physics. In the following chapter, we will 
consider how to generalize this approach to the continuum. However, 
readers who are eager to see some uses of the machinery developed in 
this chapter should now be able to tackle the first parts of the chapters 
in Part X, which deal with some applications from condensed matter 
physics. 


Chapter summary 

• Field operators ’ijA and ^ are the analogues of creation and anni¬ 
hilation operators in position space. 

• Second quantization of models can be carried out by writing down 
operators as products of creation and annihilation operators that 
encode the physical particle processes. We write these products 
in normal-ordered form (meaning the field creation operators are 
written to the left and the field annihilation operators to the right). 

• The vacuum expectation of normal-ordered products is always zero. 

• Examples of this technique have been given for the tight-binding 
model and the Hubbard model. 


Exercises 

(4.1) We can define a generalized commutator [A. B ]<; as 

[A,B] C = AB-(BA, (4.69) 

so that £ = 1 yields [A, = [A, B] for bosons 

and £ = — 1 yields [A, B]^ = {A, B} for fermions. 
The generalized commutation relations can then be 
written 

h/’(*),'0 t (y)k = <5 (3) (* - y), (4.70) 

and 

[i>(x),i>{y)] c = o. (4.7i) 

Repeat the argument given in Example 4.9 using 
the generalized commutator and show that Wrong 
yields the same result for both bosons and fermions. 


(4.2) One can define a single-particle density matrix 

as 

Pi(x - y) = (4.72) 

By substituting the expansion of ijy{x) and ^(y) in 
terms of a\ and a q into this expression, show that 

Pi(*-y) = i^ e “ l( ’ J '*“ P!/) (4 a p>- (4-73) 

pq 

(4.3) Evaluate the eigenvalues and eigenvectors of the 
Hubbard Hamiltonian given in eqn 4.68 in Exam¬ 
ple 4.11. 

(a) What are the energy eigenvalues in the limit 
t = 0? 

(b) How do these energy levels change as t 0 in 
the limit t/U <C 1? 




Part 


Writing down Lagrangians 

An important step in writing down a physical model is to construct the 
appropriate Lagrangian. This part of the book is concerned with how 
to do that and is structured as follows: 

• In Chapter 5 we describe how the arguments in the previous part 
which worked on discrete systems can be generalized to the con¬ 
tinuum limit. After reviewing Hamilton’s formulation of classical 
mechanics and Poisson brackets, we shift our attention from the 
Lagrangian L to the Lagrangian density C. We use the electromag¬ 
netic field as a first example of this approach. 

• We have our first stab at constructing a relativistic quantum wave 
equation, the Klein-Gordon equation in Chapter 6. This turns 
out to have some unsavoury characteristics that means that it is 
not the right equation to describe electrons, but it is nevertheless 
illuminating and illustrates some of the issues we are going to come 
across later. 

• In Chapter 7 we present a set of example Lagrangians and show 
how they work. After reading this chapter the reader should have 
a good working knowledge of the simplest examples of quantum 
field theories. 
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In the last few chapters we have seen examples of how various systems 
can be reduced to a set of simple harmonic oscillators, each one describ¬ 
ing a different normal mode of the system. Each normal mode may be 
thought of as a momentum state for non-interacting, identical particles, 
with the number of particles corresponding to the number of quantized 
excitations of that mode. We have defined operators that have created 
or annihilated these particles. In the previous chapter, we particularly 
focussed on discrete models (tight-binding, Hubbard, etc.) in which par¬ 
ticles moved around on a lattice. In this chapter we want to extend this 
notion to the continuum limit, that is when the discretization of the 
lattice can be ignored. This will lead to the concept of a classical field. 
Before we scale those dizzy heights, there is some unfinished business 
from classical mechanics to deal with, thereby introducing some useful 
formalism. 


5.1 Lagrangians and Hamiltonians 


In this section, we return to the Lagrangian formulation of classical 
mechanics and introduce the Hamiltonian and the Poisson bracket. The 
Hamiltonian will give us an expression for the energy which appears as 
a conserved quantity if the Lagrangian doesn’t change with time. The 
Poisson bracket is also telling us about how quantities are conserved. 
Let’s begin by considering the Lagrangian L(qi, c/i) for a system described 
by coordinates g,;. The rate of change of L is given by 


dL 
d t 


dL dL 

Vj— Qi + ww®, 
GQi OQi 


(5.1) 


where the summation convention is assumed. Using the Euler- Lagrange 
equation we can turn this into 


dL 

d 

(dL^ 

\ . dL.. 

d 

(dL . \ 

d t 

“ d t 1 

\dqij 

<U + w?i = 
1 dqi 

dt 1 



(5.2) 


We define the canonical momentum by 



and then eqn 5.2 can be written 


(5.3) 


d_ 

df 


(Pi<li - L) = 0. 


(5.4) 
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This is clearly a conservation law, and we write it as 


thereby defining the Hamiltonian H by 

H = PiQi - L. (5.6) 

We will see later that the Hamiltonian corresponds to the energy of the 
system. Since H is conserved it is instructive to look at variations in H. 

ThUS f)L dL 

SH = piSqi + Spiqi - —— Sqi - w^Sqi. (5.7) 

OQi <JQi 

The first and fourth terms in this expression cancel because of eqn 5.3, 
and so 

SH = Spiqt - —— Sqi. (5.8) 

oqi 

Since H is a function of qt and pi, we expect that 

fATT fATT 

6H = j- Sqi + Z-Spt, (5.9) 

dq t dpi 

and so matching terms in eqns 5.8 and 5.9 produces qt = dH/dpi and 

dH/dqi = —dL/dqi = —Pi, and these two equalities are known as Note that because pi = (dL/dcg) then 
Hamilton’s equations: Pi = d/dt(dL/dq,) = dL/dqi , where 

the last equality follows from the Euler- 

OH dH Lagrange equation. 

= Qi, it- = ~Pi- ( 5 . 10 ) 


Hamilton’s equations represent another way to find the equations of 
motion of a system. 

We now turn to the other piece of nineteenth century classical mechan¬ 
ics which we want to review. We define the Poisson bracket {A, H} PB 
by 

r , dAdB 8AdB 

{ ’ }pB _ d qi dp % dpi d qi ' ( } 

Now consider any function F which depends on the coordinates qi and 

Pi . The rate of change of F is given by 

d F dF dF. dF . dF rrirrl . 1 . 

df ~ ~di + % 1+ Wi Pi " ^ + { ’ >PB ’ ( } 

where the last equality is achieved using Hamilton’s equations. Thus if 
the field is not itself a function of time, we have 


Simeon Denis Poisson (1781-1840) 


= {F,H}, 


so that if {F,H} pB = 0 then F is a constant of the motion. We have 
therefore found a link between a conservation law (F is a conserved 
quantity) with the Poisson bracket of the Hamiltonian and F being zero. 
This is highly evocative of the commutator in quantum mechanics, but 
note that this result is entirely classical! 
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Fig. 5.1 The trajectory of the sys¬ 
tem moves on the intersection of sur¬ 
faces of constant H and F. This 
is because (qr, p) is perpendicular to 
both VF = (If, If) (eqn 5.14) and 

™ = (If. If) (eqn 5.15). 


Example 5.1 

There is an interesting graphical interpretation of this result of a conserved F (see 
Fig. 5.1). If F = F(q,p ), then the vector (q,p) = (dH/dp,—dH/dq) is tangent to 
the surface F(q,p) = constant, because 


VF 


,'dF 8F 
(<?-P) = 770 TT- 


, 8F OH dF dH 

(<FP) = q — t. - T— = { F , h }pb = °. ( 514 ) 

OQi dpi Uqi 


, dq 8p 

and (q, p) is also tangent to the surface H(q, p) = constant, because 


VH(q.p) 


fdH 8H\ 
\ dq dp ) 


■ (q.p) = -P Q + Q ■ P = 0 , 


(5.15) 


and so the trajectory of the system moves on the intersection of these two surfaces. 


In quantum mechanics, the rate of change of the expectation value of an 
operator F is given by 


d (F) 

d t 


^\F,HX h 


(5.16) 


and since this looks very much like the classical version 


dF 

d< 


{F,H } PB , 


(5.17) 


we are led to visualize the classical to quantum crossover as involving 
the replacement 

{F,H} pb ^^([F,H]), (5.18) 

and in fact for any Poisson bracket 

{A,B} pb ^ ^([A,B]}. (5.19) 


Example 5.2 

The Poisson bracket between position and momentum coordinates is 


{QjiPk} pb — 


Using eqn 5.19 we have [qj,Pk\ = i fobjki 


d<lj_ &Pk_ _ dPj dq k 
dqi dpi dqi dpi 
^ij^ik 0x0 

Sjk- (5.20) 

familiar from quantum mechanics. 


5.2 A charged particle in an 
electromagnetic field 

To practise some of the ideas in the previous section, we will consider a 
very simple example: a free particle of mass in. Shortly we will give it a 
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charge q and turn on an electromagnetic field, but for now we will keep 
all electromagnetic fields switched off. We want to include relativity, 
and so therefore the action S must be a Lorentz-invariant scalar. We 
can write S = f T 2 Ljdr where r is the proper time, and so 7 L must 
be a Lorentz invariant. We can thus write L = constant /j and we will 
choose 


L = — 


7 


(5.21) 


where the constant me 2 is chosen so that L has the correct low-velocity 
limit . 1 The action S can now be written 


S = 


—mc 2 dr = 


ds, 


(5.22) 


where the interval 2 ds = c 2 dt 2 — dx 2 — dy 2 — dz 2 . By the principle of 
least action SS = 0 and so S ds = 0. The integral J^ ds takes its max¬ 
imum value (see Fig. 5.2) along a straight world-line (see Exercise 5.5) 
and so this implies reassuringly that free particles move along straight 
lines. 


Example 5.3 

Find expressions for the momentum and energy of a free particle. 

dL _ d_ 
dv dv 
„2 


Answer: 


p = = — f — mc 2 (l — u 2 /c 2 ) 1 / 2 ) = jmv, 


E = H = p ■ v — L = 7 rmw 


7 


= 7 me 


“5T + ( 1-o 


= 7 me . 


(5.23) 

(5.24) 


In special relativity, we can assemble the energy and momentum into 
a momentum four-vector p^ = (^,p) [or equivalently p M p)]. 

To make our free particle more interesting, let’s give it a charge q and 
couple it to an electromagnetic field. The electromagnetic field can be 
described by a four-vector field = (TM ; yl(x)) where V(x) is the 

scalar potential and A{x) is the magnetic vector potential. The action 
becomes 3 


S = 


—me ds — qA^ dx M 



—me 2 


7 


+ qA ■ v 



d t. 


(5.25) 


The Lagrangian is the integrand of this equation, and so the canonical 
momentum in this case becomes 


dL 

p = —— = 'ymv + qA. 
ov 


(5.26) 


Now setting c = 1, the familiar E and B fields can be obtained from 
the four-vector potential A^ = ( V ., A) using the equations 


1 See Exercise 5.4. 


o 

The interval ds is an invariant, and 
so takes the same form in all inertial 
frames. In the rest frame of the parti¬ 
cle ds = edr and so this is true in all 
inertial frames. 

t 


B 



A 


1 -► x 

Fig. 5.2 The straight-line path from 
A to B has a larger f a ds than the wig- 
gly one (because cdt < \/c 2 d t 2 — dx 2 ), 
even though it looks longer. 


q 

The potential energy of the electro¬ 
magnetic field interaction is linear in 
the field A^ and takes the Lorentz- 
invariant form — qA^dx^. Note 
that A^ = and dx^ = 

(cdt, da?) = (c, v)dt and so —qA^dx^ = 
-q(V — A • v)dt. 
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Note that: 



A M = (V,A), 
A M = (V,-A), 


Also, keep in mind that 

E = (E 1 , E 2 , E 3 ), 

and 

B= (B 1 , B 2 , B 3 ), 
are ordinary three-vectors. 


but to put this in a relativistically covariant form we define the second- 
rank antisymmetric tensor F jlu by 

F lut = d ll A v -d v A tl . (5.28) 


This is known as the electromagnetic field tensor and looks like a 
four-dimensional version of a curl. In components, this yields 


v — 

[IV — 


Similarly, the tensor 


F> iV 


0 

E 1 

E 2 

E 3 

- E 1 

0 

—B 3 

B 2 

- E 2 

B 3 

0 

—B 

- E 3 

~B 2 

B 1 

0 




0 

-E 1 

-E 2 

- E 3 

E 1 

0 

—B 3 

B 2 

E 2 

B 3 

0 

—B 1 

E 3 

-B 2 

B 1 

0 


1 


\ 

/ 


\ 

) 


v 


(5.29) 


(5.30) 


The equation 
2 (B 2 - (B/c) 2 ) 
the factors of c. 


is 

if you include 


Searching for a Lagrangian for the electromagnetic field, we again look 
for a Lorentz scalar. One very suitable choice is 

F„ V F^ = 2 {B 2 - E 2 ). (5.31) 


The Lagrangian may then be written 

L = ~\ j^F^, 


(5.32) 


4 It’s useful to note that the components 
of the electric field may be extracted 
from F using 

E i = _ F 0i = F i0 ? ( 5 .33) 

and the magnetic field is given by 

B i = -e ijk F jk , (5.34) 

where e' l i k is the antisymmetric symbol 
and no sum over indices is implied in 
the last equation. 


where the choice of the factor —j will be explained later in the chapter. 4 

Finally we note a particularly important feature of electromagnetism 
is that electric charges are locally conserved. This fact is enshrined in 
the continuity equation relating the charge density p(x) and current 
density J(x), which is written 

% + v • J = = 0, (5.35) 

at 

where the electromagnetic current four-vector J^ = (p, J). The continu¬ 
ity equation prevents the destruction of charge followed by its creation 
at another arbitrarily distant point. Such a process would obey global 
conservation of charge, but would allow information to travel at an ar¬ 
bitrary velocity, which we know from special relativity is forbidden. 


5.3 Classical fields 

We will now take the continuum limit of a simple model from classi¬ 
cal mechanics and see how this leads to the concept of a classical field. 
We consider the case of a classical system of balls connected by springs 
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(whose quantum mechanics was examined in Chapter 2). In the contin¬ 
uum limit we are going to make the distance l between the balls shrink 
to zero and simultaneously let the mass in of each ball shrink to zero, 
while keeping the mass per unit length p = m/1 along the chain constant. 
The displacement qj (t) of the jth mass at time t becomes a continuous 
variable <fi(x, t) which depends on both spatial position x and time t. At 
each point x we have an independent degree of freedom <f>(x,t), so we 
say that the field has a ‘continuous infinity’ of degrees of freedom. The 
function (j){x 1 1) is one example of a classical field, but the concept can be 
much more general than this. For example, rather than describing just 
a simple displacement, our field can describe more complicated objects 
such as electromagnetic fields. In each case though, the field depends 
on position in spacetime. We can now make a working definition of the 
idea of a classical field: 5 

A classical field is a machine that takes a position in spacetime and 
outputs an object representing the amplitude of the field at that point. 
The output might be a scalar (in which case we refer to a scalar field), 
a complex number (a complex scalar field), a vector (in which case 
we refer to a vector field), a tensor or something more complicated. 

Fields are locally defined objects. The field ‘machine’ that we have de¬ 
scribed is like a computer subroutine which returns the value of <f> at the 
position x and time t that we’ve inputted. It is local in the sense that 
it doesn’t tell you about the value of <t> somewhere else. 


5 

We make the fussy assignment of a 
classical field here to distinguish it from 
a quantum field. As we will see shortly 
the latter inputs a position in spacetime 
and outputs an operator rather than an 
amplitude. 


Example 5.4 

The temperature T(x,t) is an example of a scalar field. The E(x,t) and B(x,t) 
fields of electromagnetism are familiar examples of vector fields, which could be 
time-dependent. The four-vector potential A^(x) is an example of a four-vector field 
(and here we have written it as a function of spacetime coordinates x u ). The object 
defined in eqn 5.28, F^ u (x), is an example of a second-rank tensor field. 


5.4 Lagrangian and Hamiltonian density 

The next task is to use this language of classical fields to formulate La- 
grangians and Hamiltonians. We will do this first for a simple example. 


Example 5.5 

In the discrete case considered in Section 2.4 we had a Hamiltonian given by 


H = T,§ n + \ K ^+ 1-®) 2 


(5.36) 
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and a Lagrangian given by 


l = J2 


=£-,*(« +1 -«) a - 


(5.37) 


In the continuum limit, these need to be replaced as the number of masses goes to 
infinity while i —> 0. The sums become integrals, so we make the substitution 

(5.38) 




Using the substitution in eqn 5.38 the kinetic energy term in H and L becomes 

1 /S lY - WWdtV. <»») 


E, ra 


dt 


dt 


where we’ve used the string density p = m/t. For the potential energy term we also 
make a rule for the differences in coordinates: these become spatial derivatives and 
hence 

0 Qj+ 1 -qj ) d<p(x,t) ( 540 ) 


t dx 

This means that the potential energy term becomes 

1 K f ( 

^2 \dx 

3 3 


E \ K («+i - ®) 2 - E \ Ke 


I dX 2 


T 


df\ 2 

dx ) 


(5.41) 


where we’ve replaced the spring constant K and the lattice spacing t with the ten¬ 
sion in the string T = Kl. Putting this all together (and generalizing to a three- 
dimensional system) gives a Hamiltonian 

'l fd<j>^ 2 
dt 


H = d*x 


and a Lagrangian 


/■ 

/< 


2 P 




(5.42) 


L= d^# 


Ml ) -) T < W > 2 


(5.43) 


Our expressions for H and L are integrals over a position coordinate 
and so we can define a Hamiltonian density 7 H by 


H = J d 3 xH, 

and similarly a Lagrangian density £ as 


L = / d 3 x£. 


(5.44) 


(5.45) 


In general, both the Hamiltonian density Tt and the Lagrangian density 

£ are functions of <(>. <fi and <l> . We can define a conjugate momentum 

6 Just like pi = —. For a careful jus- 7r(;r) in terms of the functional derivative 6 
aqi 

tification of the last equality see Wein- A T, r)C 

berg, Chapter 7. k{x) = —r = —r, (5.46) 

Of Of 

7 In Example 5.5 the particular expres- and then this implies that TL and £ can be related by 7 

7 i. = 7 icj) — £. 


sions are: 


£ =MS) -5 t(v « S 


1 


d<t>V 


1 


and 


8<j> 

7T = p -. 

H dt 


(5.47) 

Now recall from eqn 1.34 that the action S is related to £ by the four¬ 
dimensional integral S = J d 4 x £(</>, £(,//>), and the action principle, given 
by 6S = 0, implies that (eqn 1.35) 

3£ Wo. (5.48, 


dC » 
di~ d “ 


d(d^) 


You can check that eqn 5.47 holds. 


the four-vector version of the Euler-Lagrange equation. 
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Example 5.6 

For the electromagnetic field, the Lagrangian density is written 

C = -H liv F» v =S' : ;i(-E 2 - B 2 ). (5.49) 

In the absence 8 of an electric potential (V = 0), we have £ = \(E 2 — B 2 ) = 
4(A 2 — B 2 ). The conjugate momentum is 7r* = dC/d(doAi), and so 7r = — A = E, 
so that 

H = TrMi - C = i(£ 2 + B 2 ). (5.50) 

The Euler-Lagrange equation (eqn 1.35) gives 


dc _ ( ac \ 

9^ A U(a A A M )J 


= 0 . 


(5.51) 


In SI units £ = — -r^F uu FV ,l/ and so 

1 9 ! 


^ Recall eqn 5.27 for the relationship be¬ 
tween fields and potentials. 


In SI units H = + |e 0 E 2 . 


The first term in this expression is zero since £ = — ^Fy V F^ u contains only deriva¬ 
tives of Ay,. The second term can be rewritten as 


3 X F X » = 0, 


(5.52) 


which is a compact way of writing two of Maxwell’s equations in free space, namely 
V • E = 0 and V X B = E. The current density four-vector = (p, J) can 
also be included, and couples linearly to the electromagnetic potential, and so the 
Lagrangian becomes 

C = -H llv F> iV - J^Ap. (5.53) 

In this case 



(5.54) 


and so 

= J", 


(5.55) 


which generates the other two Maxwell equations: V • E = p and V x B = J + E. 


Ill SI units d\F x e = no J M . 


Solving the Euler-Lagrange equation will produce all the allowable 
modes of oscillation and hence the allowed wave vectors, the particu¬ 
lar values k n , that may occur without dissipation. By the principle of 
superposition the most general wave is one made up of a weighted sum 
of all of the possible modes 

<f>(x,t) = Y^a kn e- i{wt - kn ' x) . (5.56) 

k n 


Thus we have seen how many classical continuous systems may be ex¬ 
pressed in a Lagrangian formalism so that the Euler -Lagrange equation 
allows us to derive the normal modes. Our experience with the simple 
harmonic oscillator in Chapter 2 motivates the idea that these normal 
modes are simply harmonic oscillators themselves and so can be quan¬ 
tized, leading to particle-like solutions. For the electromagnetic field, 
those quanta are of course photons. 9 Now with this classical field the¬ 
ory under our belt, we are ready to return to the quantum world, and 
in the next chapter we will make our first stab at relativistic quantum 
mechanics. 


9 We will put this on a firmer footing in 
Chapter 14 and show in detail liow pho¬ 
tons emerge from quantizing the classi¬ 
cal electromagnetic field. 
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Chapter summary 

• A field takes a position in spacetime and outputs an object repre¬ 
senting the amplitude (be it scalar, vector or complex) of the field 
at that point. 

• The Lagrangian and Hamiltonian density are related by Ti = nefi—C 
where 7r = dC / d<f> is the generalized momentum. 

• The Lagrangian for the electromagnetic field is C = — jF^F^, 
where F^ v = <9 M A„ — d v A^ is the electromagnetic field tensor. 


Exercises 


(5.1) If the Lagrangian does depend explicitly on time, 
then 


d L _ dL dL . dL .. 

df dt dxi Xt dii X1 


(5.57) 


In this case show that 
dL 
dt 


AH 

dt 


(5.58) 


(5.2) Show that Poisson brackets anticommute 


{A, B} pb = — {B, A} pb , (5.59) 

and also satisfy the Jacobi identity 
{{A,B} 

PB > C}pB + {{C> PB J b} pb ,, fim 

+ {{B,C} pb ,A} pb = 0, j 

and show that quantum mechanical commutators 
also have the same properties. 

(5.3) Show that the commutator of two Hermitian oper¬ 
ators A and B is anti-Hermitian, i.e. that 


[A, B]' 1 =-[A, B\. (5.61) 


(5.6) Use the Lagrangian L = + qA ■ v — qV for 

a free particle of charge q and mass m in an elec¬ 
tromagnetic field to derive the Lorentz force, i.e. 
show 

^(ymu) = q(E + v x B). (5.62) 

[Hint: V(o b) = (a ■ V )b + (b ■ V)a + b x curl a + 
a x curl b.] 

(5.7) Show that for the Lagrangian 

—m.c 2 

L = -b qA ■ v — qV, (5.63) 

7 

when v -C c the momentum becomes p = mv + qA 
and the energy becomes 

E = me 2 + -^—(p — qA) 2 + qV. (5.64) 
2m 

(5.8) Another Lorentz scalar that can be obtained from 
Fpv is 

e afh, *F a f,F yS . (5.65) 


The factor of i in many commutator expressions 
(e.g. [x,p] = i h, [L x ,L y ] = ihL z , and [A,B] = 

{A, -B} pb ) makes sure that this property is 
obeyed. 

(5.4) The Lagrangian for a free particle is L = —me 2 /y. 

Find an expression for L, p and H when v -C c. 

(5.5) Show that f b ds takes its maximum value when the 
integration is done over a straight world-line be- (5.10) 
tween a and b [NB ds 2 = c 2 df 2 — d* 2 — Ay 2 — dz 2 ]. 


Show that this leads to E ■ B being an invariant. 

(5.9) Show that d tl F /JI ' = J 1 ' reduces to two of Maxwell’s 
equations, and that the other two can be generated 
from the equation 

d\F + d^F v a + d„F\n = 0. (5.66) 

Show that dpd a F a ® = 0 and hence that d^ J M = 0. 
Interpret this as a continuity equation. 




A first stab at relativistic 
quantum mechanics 



Bohr: What are you working on? 

Dirac: I’m trying to get a relativistic theory of the electron. 

Bohr: But Klein has already solved that problem. 

(Conversation at the 1927 Solvay Conference.) 

We now take a break from the formalism of classical mechanics and turn 
back to quantum mechanics. In this chapter, we will attempt to find 
a quantum mechanical equation of motion that is properly relativistic. 1 
This will lead to an equation known as the Klein-Gordon equation, 
though in fact it was derived first by Schrodinger, who rejected it because 
it had what seemed like a fatal flaw (which we will explore later); he 
then went on to a second attempt at the problem and produced what 
is known as the Schrodinger equation, which works brilliantly but is 
non-relativistic. 


6.1 The Klein—Gordon equation 


6.1 The Klein—Gordon equation 
59 

6.2 Probability currents and den¬ 


sities 61 

6.3 Feynman’s interpretation of 
the negative energy states 61 

6.4 No conclusions 63 

Chapter summary 63 
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1 It , s a first stab because the right an¬ 
swer is due to Dirac, and we will deal 
with that later. This chapter is about 
the attempt introduced by Klein and 
also by Gordon, which is why in the 
opening quote Bohr didn’t think Dirac 
was working on something worthwhile. 


We first review the logical chain of ideas which leads to the Schrodinger 
equation. In the non-relativistic quantum mechanics of a free particle, 
we start with the dispersion relationship, which links the energy and 
momentum of the particle using the simple expression for kinetic energy 
E = 2 m- We turn the variables E and p into operators (E —> E,p —^ p), 
and then we make the substitutions 2 E = iand p = — ifiV. This 
then leads to the Schrodinger equation 


■ j-d(f>(x, t) 
dt 


2m 




( 6 . 1 ) 


2 

Just to make some of the equations a 
bit more familiar, we will briefly rein¬ 
troduce all the factors of h and c. 


where the object that the operators act on is <f>(x, t), the wave function. 

The free-particle solutions to the Schrodinger equation are plane waves 
of the form <p(x, t) = iVe -I (“* -fc ' !E ), where N is a normalization constant. 
We can write this solution in four-vector form <j>(x) = Ne~ lp ' x . We’ll 
call a wave with this sign combination in the exponential an incoming 
wave. Operating on it with momentum and energy operators yields 


p<j)(x,t) = —ifiV(./Ve l( - a ’ t fc ' x )) = hk(j)(x,t), 

E<j)(x,t) = ifi^(A f e _1 ^ wt ” fea5 ^) = hco(j)(x,t). 


( 6 . 2 ) 
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Oskar Klein (1894-1977) 
Walter Gordon (1893—1939) 


^Note that d 2 = — V 2 . 


That is, an incoming wave has a positive momentum and energy. 

Now, to get a relativistic wave equation we’ll try the same trick. For 
a relativistic particle we have the dispersion relationship 

E = {p 2 c 2 + ra 2 c 4 )2, (6.3) 

and by making the same operator substitutions as we had before (E —> 
E = ifij^,p —■> p = —iftV) we obtain 

\h^~ = (—fi 2 c 2 V 2 + m 2 c 4 )5(/). (6.4) 

at 

This has two big problems. Number one: it doesn’t look covariant (the 
spatial and temporal derivatives appear in different forms). Number 
two: how do we cope with the square root sign which seems to imply 
that we take the square root of a differential operator? 

Instead of tackling these problems we take a side-step. To avoid the 
square root we’ll simply square the dispersion relation and start again. 
The squared form of the dispersion is E 2 = p 2 c 2 + m 2 c 4 . Making the 
same operator substitutions as before, we obtain 

— h 2 — ^ = (— H 2 c 2 V 2 + m 2 c 4 )4>(x,t). (6.5) 

This equation of motion for a wave function is called the Klein—Gordon 
equation. To make things as easy to read as possible we’ll revert to 
using natural units where H = c = 1 from now on, and this gives us the 
following form of the Klein-Gordon equation: 

_ d <H-M) = (_v 2 + m 2 )<j>(x,t). (6.6) 

The Klein-Gordon equation fits in nicely with our relativistically covari¬ 
ant language, since we can write 

(<9 2 + m 2 )(j){x) = 0, (6.7) 

thereby amalgamating the space and time derivatives. 3 


^ In natural units k = p and to = E 
and since we’re mostly interested in the 
energies and momenta of particles we’ll 
use p and E. 


Example 6.1 

Let’s solve the Klein-Gordon equation for a free particle. We’ll try a solution 4 
< p(t,x ) = N e ~ lEt + 1 P' x — Ne~ ip ' x . Upon substituting we find that </>( x) is indeed a 
solution, yielding the expected dispersion 

E 2 =p 2 + m 2 . (6.8) 


This looks fine, until we realize that we have to take the square root to get the energy 
which gives E = d =(p 2 +m?). We might be tempted to just throw away the negative 
solutions as unphysical, but in fact the appearance of negative energy solutions is 
going to be telling us something. 
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6.2 Probability currents and densities 


One of the reasons that Schrodinger wasn’t happy with the Klein 
Gordon equation after he’d derived it was that something rather nasty 
happens when you think about the flow of probability density. The 
probability of a particle being located somewhere depends on <jf {x)(j){x) 
and so if this quantity is time-dependent then particles must be sloshing 
around. The probability density p and probability current density 5 j 
obey a continuity equation 

^ + V-i = 0, (6.9) 

at 

which is more easily written in four-vector notation as 


V = o. 


( 6 . 10 ) 


If, as is usual in non-relativistic quantum mechanics, we take the spatial 
part to be 

j(x) = -i [(/>*(x)~V4>(x) - cj)(x)V (j>* (x )\, (6.11) 


then, for eqn 6.10 to work,' we require the probability density to look 
like 8 

<t> W—di - ~^ x > ' 


p(x) = 


dt 


( 6 . 12 ) 


The resulting covariant probability current for the Klein-Gordon equa¬ 
tion is then given by 


f{x) = 1 {0*( x)d^(x) - [d^*(x)} 00c)} , (6.13) 

which, as the notation suggests, is a four-vector. Substituting in our 
wave function <j>(x) = Ne~ lp ' x gives the time-like component of the prob¬ 
ability current as 

j° = p = 2\N\ 2 E. (6.14) 

Since E can be positive or negative we can’t interpret p as a probability 
density, since there’s no such thing as negative probability! This seems 
to ring the death knell for the Klein-Gordon equation as a half-decent 
single-particle quantum mechanical equation. 


6.3 Feynman’s interpretation of the 
negative energy states 


Fortunately, all is not lost. Richard Feynman 9 and, independently, Ernst 
Stuckelberg 10 came up with an interpretation of the negative energy 
states as particles moving backward in time. We call these states an¬ 
tiparticles. Consider the classical equation of motion for a charged 
particle in an electromagnetic held which looks like 


TO 


d 2 x p 
dr 2 


qF^ 


dx v 

dr 


(6.15) 


5 

Here j is the current density of par¬ 
ticles, whereas in Example 5.6 we used 
J as the current density of charge. If 
both appear together, as they will do 
in Section 6.3 below, we will write the 
former as j and the latter as J em - 

^ In non-relativistic quantum mechan¬ 
ics, j is defined with some additional 
constants: 

j = V</> - ip'Vi/j*). 

We dispense with the h/2m constant 
here. 

•7 

It will work, and you can prove it as 
follows. Take the Klein-Gordon equa¬ 
tion (eqn 6.5) and premultiply it by 
<j>*. Then take the complex conjugate 
of eqn 6.5 and premultiply by </>. Sub¬ 
tracting these two results will give an 
equation of the form of eqn 6.9 with j 
and p as given. 

^ Note that this is very different 
from non-relativistic quantum mechan¬ 
ics where p = \ip\ 2 - 

9 Richard Feynman (1918-1988) in¬ 
vented much of the physics in this book. 
The bongo-playing theoretical physicist 
once defined science as ‘the belief in the 
ignorance of experts’. He was never sat¬ 
isfied by an explanation until he had 
worked it out for himself, and it is per¬ 
haps fitting that his quirky shorthand 
for doing quantum field theory calcu¬ 
lations, the little cartoons we now call 
Feynman diagrams, decorate the pages 
of every book on quantum field the¬ 
ory, including this one. (He also noted 
that bongo players didn’t feel the need 
to point out that he was a theoretical 
physicist.) 

19 Brilliant, eccentric and difficult, the 
Swiss mathematician and physicist 
Ernst Stueckelberg (1905-1984) may 
also claim credit for the invention of 
a significant amount of the physics in 
this book, although the opaqueness of 
many of his arguments and his insis¬ 
tence on inventing novel notation have 
prevented this from being widely recog¬ 
nized. His achievements include: this 
interpretation of antiparticles; the in¬ 
vention of spacetime (‘Feynman’) dia¬ 
grams; the explanation of (‘Yukawa’) 
force-carrying particles; the discovery 
of the renormalization group and the 
conservation of baryon number. See 
Schweber, Chapter 10 for more details. 
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11 Incoming implies inclusion of a factor 
e~ ipx . 


BEFORE AFTER 



Fig. 6.1 (i) An incoming particle (en¬ 
ergy E, charge e (= |e|), momentum 
p) is absorbed by a system (energy £, 
charge Q, momentum q ). Also shown 
are (ii) emission of particle, (iii) absorp¬ 
tion of a particle in a negative energy 
state and (iv) emission of an antiparti¬ 
cle in a positive energy state. 


where r is proper time, q is charge and F^ v is the electromagnetic field 
tensor. Changing the sign of the proper time r has the same effect as 
changing the sign of the charge. Therefore, a particle travelling back¬ 
ward in time looks the same as an oppositely charged antiparticle mov¬ 
ing forward in time. One strategy to eliminate negative energy states 
is therefore to reverse the charge and momentum of all negative energy 
solutions, turning them into antiparticles. How does this eliminate the 
negative sign of the energy? Since we write the phase of the wave func¬ 
tion as Et — p ■ x, then making the substitution t —> —t means that we 
have to change —E —> E for the first term to make sense. Of course 
reversing the time (like playing a film backwards) reverses all momenta 
so we also need to swap p —> — p for consistency. 

We can go further by invoking some quantum mechanics. Let’s ex¬ 
amine the electromagnetic current density for a Klein-Gordon particle 
which we will define as = qj M where q is the charge of the particle. 
The current four-vector J^ m for a positively charged, incoming particle 11 
with positive energy is given by 

JL = (+«)2|JV|V (6.16) 

= (+q)2\N\ 2 (E,p). (6.17) 

In contrast, a negative energy particle with positive charge has = 
(+q)2\N\ 2 (— E,p ). Since we’ve decided that negative energies are dis¬ 
gusting because they mess up the probability, we want to turn the energy 
positive. We do this by taking the minus sign out to the front of the 
equation to yield 

J£ n = (-q)2\N\ 2 (E,-p). (6.18) 

Notice that we have to swap the sign of charge and the three-momentum 
p 1 changing an incoming particle into an outgoing antiparticle, in agree¬ 
ment with our argument above. 

We now have a rule to turn unphysical negative energies into physical 
positive energies. Simply treat the particle as one with positive energy, 
but change the particle into an antiparticle by reversing the charge and 
swapping the sign of the three-momentum. 

Is this really permissible for thinking about interacting quantum par¬ 
ticles, which we know from quantum mechanics can be created and de¬ 
stroyed and emitted and absorbed? It is, as we’ll see from Fig. 6.1. 
In a typical process we might have an incoming particle absorbed by 
a system, donating its energy, momentum and charge to the system 
[Fig. 6.1 (i)]. Alternatively a particle could be emitted, depleting the 
system of energy, momentum and charge [Fig. 6.1 (ii)]. The key point is 
that absorption of a particle in a negative energy state [Fig. 6.1 (iii)] is 
equivalent to the emission of an antiparticle in a positive energy state 
[Fig. 6.1(iv)], at least as far as the system is concerned. 

To summarize then, although the dispersion relation E = ± \/p 2 + m 2 
admits positive and negative solutions, only positive energies are physi¬ 
cal. We therefore make all energies positive, but we can’t just ignore the 
formerly negative energy states. The formerly negative energy states are 
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12 

The assignments ‘incoming’ and ‘out¬ 
going’ are made to link up with the 
construction of Feynman diagrams in 
Chapter 19. 


This general solution is therefore not an inherently single-particle de¬ 
scription, because we are forced to consider particles and antiparticles. 

6.4 No conclusions 

Although the Klein-Gordon equation has now been made semi¬ 
respectable with Feynman’s interpretation it still has an uncomfortable 
feeling about it. What is implied by the new interpretation? Does it 
correspond to anything sensible in real life? We’ll leave these questions 
for a moment. The Klein-Gordon equation will return later and we will 
find that it is, in fact, a perfectly respectable equation but in a slightly 
different context, namely that of describing the dynamics of fields whose 
excitations are spinless particles. Our next step is to systematize our 
procedure for writing down theories. 


interpreted as positive energy antiparticles with momenta in the opposite 
direction to the corresponding particle. A general solution to the Klein 
Gordon equation with a particular positive energy now corresponds to 
a superposition of two states 12 



Incoming positive 


Outgoing positive 

4>{x) = 

energy particle 

+ 

energy antiparticle 


ex e ~K Et ~P x ) 


a e +i (Et-p x) 


Chapter summary 

• The Klein-Gordon equation is ( 8 2 + m 2 )(f> = 0. 

• It suffers from having negative energy states and negative proba¬ 
bilities. 

• We can think of these negative energy states as positive energy 
antiparticles 


Exercises 


(6.1) This problem looks forward to ideas which we will 
consider in more detail in the next chapter. Show 
that the Lagrangian density C given by 

C = \ - lm 2 <t> 2 , ( 6 . 20 ) 


yields the Klein-Gordon equation when using the 
Euler-Lagrange equation. Also derive expressions 
for the momentum n = dC/d(j> and the Hamilto¬ 
nian density Tt = n(j> — C for this Lagrangian and 
interpret the result. 
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1 Lagrangian density is a bit of a mouth¬ 
ful, so we are using the universal short¬ 
hand of referring to Lagrangian densi¬ 
ties simply as ‘Lagrangians.’ Pedants 
beware. 


2 n 

Symmetry will turn out to be essen¬ 
tial to our understanding of field the¬ 
ory. However, we postpone its discus¬ 
sion until Chapter 10. 


I regard as quite useless the reading of large treatises of pure 
analysis: too large a number of methods pass at once before 
the eyes. It is in the works of applications that one must 
study them; one judges their ability there and one apprises 
the manner of making use of them. 

Joseph-Louis Lagrange (1736-1813) 


Theoretical physics is all about writing down models to describe the 
behaviour of particular systems in the Universe. The insight that we 
have gained from the previous chapters is that the main starting point 
for a field theory is writing down an appropriate Lagrangian. 1 This is 
such an important step in the process that we are devoting an entire 
chapter to it, partly to reiterate some of the ideas introduced earlier and 
partly to demystify the act of plucking a Lagrangian out of thin air as 
practised by many professional theorists. 

Why do we choose a particular Lagrangian £? Usually because it’s 
the simplest model that contains all the key physical ideas and has the 
correct symmetry for the problem. 2 Its artful construction allows it to 
spit out the right equation of motion once fed into the waiting jaws of 
the Euler-Lagrange equation, given by 


dc (dc\ 

d<t> Hwoy 


(7.1) 


Then, the wavelike solutions of this equation of motion can be inter¬ 
preted as particles with the appropriate dispersion relation, i.e. the rela¬ 
tion between energy and momentum. This process of upgrading classical 
modes into quantum particles will be described in detail in the next part 
of the book. Here we will just assume that it can be done somehow. 

In this chapter, we will examine a succession of simple examples of 
choices of Lagrangian, for massless and massive scalar fields, a scalar field 
coupled to an external source, a couple of scalar fields and a complex 
field (deferring vector and spinor fields to later in the book). 


7.1 A massless scalar field 

o 

The assignment of masslessness will be 

explained shortly. We start with the simplest possible Lagrangian, one for a massless 3 




7.2 A massive scalar field 65 


scalar field 4 4>{x). The scalar field assigns a scalar amplitude to each 
position x in spacetime. It can change as a function of spacetime co¬ 
ordinate and its gradient is d p (j> = d(f>{x) / dx^ . The Lagrangian will 
depend only on the rate of change of <fi in time do4> and in space V0. 
We have to make a choice of C which is relativistically covariant and so 
we choose 

C=\d^d^=-{d^f. (7.2) 

This can be expanded as £ = ^(do(fi) 2 — • V0, which has the look 

of 5 C = (kinetic energy) — (potential energy). The factor of \ is included 
so that later results come out nicely, as we shall see. 


Example 7.1 

Using the Lagrangian in eqn 7.2, we have 


dC 

8<j> 


dC 

d(d p <t>) 


= 


and hence plugging into the Euler-Lagrange equation (eqn 7.1) we have 


d p d^(j) = 0 . 


This is the wave equation d 2 <j> = 0, or 




and has wave-like solutions 


<f>(x,t) = J2 a P e ~ i(E ,pt_p ' x) , 


with dispersion relation E p = c\p\ [though in our units c = 1], see Fig. 7.1. 


(7.3) 


(7.4) 


(7.5) 


(7.6) 


Note that E p = 0 at \p\ = 0 and so the dispersion relation is gapless. 
If we take this equation to describe the energies of the particles that 
will (later) represent quantum excitations of this system then, since this 
is the m — 0 version of the relativistic dispersion E p = \J p 1 + m 2 , 
we call the particles massless and the field a massless scalar field. The 
wave solutions of this linear equation obey the principle of superposition. 
We say that the particles which correspond to these waves are free or 
non-interacting. If we send these particles towards each other they will 
simply pass on through each other without scattering. 

7.2 A massive scalar field 

The next step is to try and include mass in the problem. To do this, we 
make C depend not only on d p (f> (how the scalar field changes through 
spacetime) but also on 4> (the magnitude of the scalar field). This can 
be done by introducing a potential energy term U{4>) (and since La- 
grangians are kinetic energy minus potential energy, this will come with 
a minus sign). The potential energy term expresses the cost of having a 


4 Throughout this book we assume the 
scalar field <j>{x) outputs a real number 
rather than a complex one. The com¬ 
plex scalar field il>{x) will be discussed 
separately. 


®Note that 4 (d,,8)‘ 2 is often called the 
kinetic energy as a shorthand. 



P 


Fig. 7.1 The dispersion relation E p = 
\p\- 
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/E P = |p| 


P 


Fig. 7.2 The dispersion relation E p = 
\Jp 2 -\- m 2 for a massive particle. 


shorthand version of the same equa¬ 
tion is 

( d 2 + m 2 )(f)(x) = J(x). 


field there at all, rather than just vacuum. By choosing U(<t>) oc </> 2 we 
will have a quadratic potential energy with a minimum at 4> = 0. Thus 
we write down 

£ = \ (<9^) 2 - \m 2 4 > 2 , (7.7) 

where the |to 2 factor is chosen to make the next bit come out right. 
The parameter m has yet to be shown to be anything physical, but will 
turn out (of course) to be the particle mass. 


Example 7.2 

Using the Lagrangian in eqn 7.7, we have 


dC 

d<f> 


—m 2 (j>, 


dC 


= 


and hence plugging into the Euler-Lagrange equation (eqn 7.1) we have 


(<9 M 9 M + m 2 )<f> = 0. 


(7.8) 


(7.9) 


The equation of motion for this field theory is the Klein—Gordon equation! The 
solution of these equations is again if>(x,t) = ae~ l ^ E ’P t ~ p a! \ with dispersion E 2 = 
p 2 + m 2 (see Fig. 7.2). 


Obviously if m = 0 we revert to the case of Section 7.1. With ?n / 0 we 
have a gap in the dispersion relation (at p = 0, E p = ±m) corresponding 
to the particle’s mass. Again, the equations of motion are linear and 
therefore the particles described by this theory don’t interact. 


7.3 An external source 

We now want to introduce interactions. The simplest way to do this is to 
have the scalar field interact with an external potential. This potential is 
described by a function known as a source current J(x ) which interacts 
with the held, giving a potential energy term —J(x)<j>(x). The resulting 
Lagrangian is written 

£ = \[9Mx)} 2 - ^m 2 [0(a;)] 2 + J(x)<t>(x). (7.10) 

The equations of motion become 6 

(<9 m <9 m + m 2 )<t>(x) = J{x). (7.11) 

This is now an inhomogeneous partial differential equation. The use of 
sources (or sorcery) will later turn up as a method of probing quantum 
fields following a technique invented by Julian Schwinger. 
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7.4 The 0 4 theory 


How do we get particles to interact with other particles (or, equiva¬ 
lently, fields to interact with other fields)? Here’s the simplest recipe. 
The <j > 4 Lagrangian is like the scalar case, but with an extra potential 
term U((f>) proportional to <(> 4 . This term makes the theory interacting. 
Unfortunately it also makes it unsolvable. As we’ll see, we have to use 
a sort of perturbation theory to make predictions from this theory. The 
Lagrangian is 7 

C = (7.12) 

which leads to the (not-very-memorable) equation of motion 


^The numerical coefficient of the </> 4 
term is often chosen with convenience 
in mind. We will use several in the 
course of the book, but the coefficient 
will be most common. 


(<9 2 +m 2 )(j) = (7-13) 

The constant of proportionality A is known as the interaction strength. 
Clearly, when A = 0 we return to the massive scalar field Lagrangian 
and the fields don’t interact. 


7.5 Two scalar fields 


Here’s another way to make particles interact. Why not have two differ¬ 
ent types of particle in the Universe, described by two fields <f>i{x) and 
<j> 2 (x)? In our simple theory they have the same mass and will inter¬ 
act with themselves and each other via a potential energy U(<j> 1 ,^ 2 ) = 
g (<t>\ + $ 2 ) where g is the interaction strength here. Notice that mul¬ 
tiplying out the bracket gives self-interacting <j> 4 terms and, crucially for 
us, a cross-term 2which forces the two types of field to interact. 
The resulting Lagrangian density is 

£ = ^(d/^1) 2 - + g(<t>l + <f>l ) 2 . 

(7.14) 


We notice that we get the same Lagrangian as in eqn 7.14 even if we 
change our definition of the fields. (We say that the Lagrangian has a 
symmetry.) If we transform the two fields using the mapping <f>i —> </>( 
and 4>2 —> </>2 where 


V 2 




(7.15) 


then the Lagrangian is unchanged. This looks a lot like we’ve rotated 
the fields in (fri-fo space. We say that the particles have an internal 
degree of freedom. It’s as if they carry round a dial labelled by 0 and 
by turning this dial (i.e. changing 9) a (f >i particle may be turned into 
a superposition of </>i and </>2 particles. The invariance of the physics 
(via the Lagrangian) with respect to rotations by 6 in the </>i-02 plane 
expresses an 50(2) symmetry. 8 


8 SO(2) is the two-dimensional special 
orthogonal group, corresponding to the 
group of 2 x 2 orthogonal matrices 
with unit determinant. This is a con¬ 
tinuous symmetry and we will later 
show that continuous symmetries lead 
to conserved quantities. More of this in 
Chapters 13 and 15. 



68 Examples of Lagrangians, or how to write down a theory 



Real 



Real 


Fig. 7.3 The complex scalar field on 
an Argand diagram. 


7.6 The complex scalar field 

We can also make a transformation that simplifies the two-scalar-fields 
Lagrangian considered in the previous section. We define two new fields 
if and ?/d given by 


ijfi 


1 

75 

i 

75 


[4>i + 77 . 


[<t> 1 - i<7- 


(7.16) 


This transformation can be pictured by an Argand diagram with <f>i 
along the real axis and <f >2 along the imaginary axis [Fig. 7.3(a)]. Using 
the new fields we obtain a new Lagrangian 


C = d^ip^d^ip ~ — g(ip^ip) 2 - ( 7 - 17 ) 


This is known as a complex scalar field theory. Although we originally 
made it up from two sorts of field, we can imagine that it describes 
one sort of complex-valued field xf. This new field is constructed from 
a bit of (f >i and a bit of (f >2 and retains two degrees of freedom. The 
new Lagrangian (eqn 7.16) is invariant with respect to rotations in the 
complex plane [Fig. 7.3(b)] 


if —■> tfe 101 , 7 —> e ia ij)\ 


(7.18) 


9 C/(1) is the one-dimensional group of which expresses a U( 1) symmetry. 9 
unitary transformations. Mathemati¬ 
cally we note that SO (2) is isomorphic 

to C7(l) (see Fig. 7.4), so nothing has i- 

really changed at a fundamental level. ' - 




Fig. 7.4 The group 50(2), describ¬ 
ing rotations in two-dimensions, is iso¬ 
morphic to 1/(1), the one-dimensional 
group of unitary transformations de¬ 
scribed by the phase of the complex 
number ^ = l^le 1 ^. 


Example 7.3 

The i/d?/></> theory: Finally, as a rather more complicated example, we consider a 
theory with three types of particle. We add together Lagrangians for the complex 
scalar field theory (with mass m) and for the scalar field theory (with mass g ,) and also 
arrange for the 0 and ^ particles to interact via a potential gif)' The Lagrangian 
is 


jC = 

+ \(.dr<f) 2 - \h 2 <t> 2 - gil>H<l>- (7.19) 

As we will see a little later, this theory looks a lot like quantum electrodynamics! 


The fields in this chapter have all been classical and although we’ve 
claimed that particles are the excitations of these fields we have yet to 
see how they arise from quantum mechanical versions of the fields. This 
is the subject of the next part of the book. 
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Chapter summary 

• Lagrangians are deduced for a massless scalar field, massive scalar 
field, the inclusion of a source, two scalar fields and a complex field. 

• For example, the Lagrangian for a massive scalar field coupled to 
a source is 

£ = \[d^4>{x)f - ^m 2 [(p(x)] 2 + J(x)(j>(x). 

• The Euler-Lagrange equation can be used to deduce an equation 
of motion. 

• For example, the equation of motion for a massive scalar field cou¬ 
pled to a source comes out to be 

(■ d 2 + m 2 )(p(x) = J(x). 

• The complex scalar field xp = ^ 75(^1 + i</> 2 ) possesses a 17(1) sym¬ 
metry. 


Exercises 


(7.1) For the Lagrangian C given by 

1 1 00 

C = -d^dA - j" E X n<p 2n+2 , (7.20) 

71=1 

show that the equation of motion is given by 

OO 

(d 2 + m 2 )(j> + A "( 2n + 2 )(p 2n+1 = 0. (7.21) 

71=1 

(7.2) Consider a massive scalar field (p{x) coupled to 
a source J(x), described by the Lagrangian of 
eqn 7.10. Show that the equations of motion are 
those of eqn 7.11. 

(7.3) Show that the equations of motion following from 
the Lagrangian in eqn 7.14 are the coupled equa¬ 
tions 

+ m 2 (pi + 4gcpi(<pl + (pi) = 0, (7.22) 

dpctfa + m 2 (p 2 + Ag(p 2 {(pl + (pi) = 0. (7.23) 


(7.4) Show that eqn 7.7 is equivalent to 



I(V0) 2 -im 2 0 2 . 


Show that 



and 


Define the quantity 


IE = 


d£ 

d(d„(p) ’ 


and show that IE = and II 0 = n = (p. 


(7.24) 


(7.25) 


(7.26) 


(7.27) 





Part III 


The need for quantum 
fields 


In this part we introduce quantum fields, how to construct quantum field 
theories, and look at the transformations that can be applied to them. 
Understanding the underlying symmetries of a quantum field turns out 
to be crucial for exploring the properties of quantum field theory. This 
part is structured as follows: 

• To study quantum fields, we must first describe how to include 
time dependence, which is covered in Chapter 8. In doing this, 
we inadvertently stumble on a reason why single-particle quantum 
mechanics is insufficient for fully describing the world in which we 
live. 

• In Chapter 9 we discuss how to describe the response of quantum 
mechanical states to transformations: translations, rotations and 
Lorentz boosts. 

• This sets up the treatment of Noether’s theorem in Chapter 10, 
demonstrating the fundamental relationship between symmetry 
and conservation laws. 

• We are then ready in Chapter 11 to plunge into the deep waters of 
constructing quantum field theories out of classical field theories, a 
manufacturing process termed canonical quantization. We give an 
extended example of this process for complex scalar field theory in 
Chapter 12 

• These ideas are extended to multicomponent fields in Chapter 13 
and we work this through for the case of massive electromagnetism, 
thereby studying a spin-1 vector field. 

• Transformations so far have been global. When examining local 
transformations we find ourselves introducing gauge theory and 
this is covered in Chapter 14. 

• In Chapter 15 we cover discrete symmetries: C, P and T. We look 
at the connection between group theory and the symmetries we 
have been studying in Part III. 
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1 Single-particle quantum mechanics, as 
the name suggests, describes the quan¬ 
tum behaviour of a single particle con¬ 
fined in some potential. As we will de¬ 
scribe, even in such a starting situation 
it is possible to have to worry about 
the appearance of particle-antiparticle 
pairs, and thus the assumption that 
‘you are just dealing with a single par¬ 
ticle’ is not as straightforward as you 
might think. 

2 This is of course named after Austrian 
physicist Erwin Schrodinger (1887— 
1961). 


The passage of time 


The innocent and the beautiful 
Have no enemy but time 
W. B. Yeats (1865-1939) 

He that will not apply new remedies must expect new evils; 
for time is the great innovator 
Francis Bacon (1561-1626) 

There is more than one way of putting time into quantum mechanics. 
In this chapter we’ll discuss Heisenberg’s version of quantum theory in 
which all the time dependence is located in the operators, while the 
wave functions stay fixed and unchanging. This contrasts with the more 
familiar version that Schrodinger formulated in which the wave func¬ 
tions carry the time dependence, the formulation that is taught in most 
introductory courses. Far from being a dry exercise in formalism, this 
will allow us to kill single-particle quantum mechanics 1 stone-dead and 
guide us in a direction towards quantum fields. 


8.1 Schrodinger’s picture and the 
time-evolution operator 

The most familiar way of doing quantum mechanics is to think about 
time-dependent wave functions. The manner in which the wave function 
changes with time is given by the Schrodinger 2 equation 

, d-0OM) = #^(3, ; t), (8.1) 

which is an equation of motion for the wave function. Dynamic variables 
like momentum and position are accessed with operators p = — iV and 
x = x respectively, which act on the wave function. This way of doing 
quantum mechanics is known as the Schrodinger picture and is com¬ 
pletely different from classical mechanics, where the dynamic variables 
depend on time [e.g. p{t),x(t), etc.] and are themselves described by 
equations of motion. The root of this difference is that time itself is 
a funny thing in quantum mechanics. There is no operator that tells 
us when some event occurred; instead time is a parameter that in the 
Schrodinger picture is located purely in our wave function 

Although no operator tells us the time, there is an operator Z7(f 2 ,ii) 
that can evolve a particle forward through time, from time t± to i 2 . Thus 
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we can write 

ipfo) = U{t 2 ,t 1 )ip(t 1 ), (8.2) 

and we will call U(t. 2 ,ti) a time-evolution operator. Notice that the 
time-evolution operator has two slots in which times can be inserted: 
the start-time t\ and the stop-time t 2 - 

The time-evolution operator has five very useful properties: 

( 1 ) = 1 . 

If the two times are the same, then nothing can happen and U is 
just the identity operator. 

(2) U(t 3 , t2)U(t 2 , t\) = U(ts,ti). 

This is the composition law and shows that we can build up a time 
translation by multiplying a set of smaller translations that do the 
same evolution, but in steps [see Fig. 8.1(a)]. 

(3) i^-t/(f 2 ,Ii) = HU{t 2 ,h). 

dt2 

The proof of this equation is simple. Differentiating eqn 8.2 with 
respect to t 2 gives 


d ip{t 2 ) d U(t 2 ,ti) 

— = —as - W' 1 ’ 

but eqn 8.1 gives 

. di, (D) = _ HU(t 2 ,ti)ip(ti), 


(8.3) 

(8.4) 


U (£ 3 , h) 



Fig. 8.1 (a) The composition law: 

U{H,t 2 )tj{t 2) t 1 ) = t>(t 3 ,ii). (b) The 
inverse of a time-evolution operator is 
equivalent to reversing the direction of 
time: U(ti, t 2 ) = C/ _1 (f 2 , bl¬ 


and the result follows. This shows that the time-evolution operator 
itself obeys the Schrodinger equation. 

(4) U(ti,t 2 ) = U~^{t2,t\), 

so by taking the inverse of a time-evolution operator, one can turn 
back time [see Fig. 8.1(b)]. 

(5) UHt2,tl)U(t2,h) = l, 

i.e. the time-evolution operator is unitary. The proof is short. 
When t 2 = ti, the result is trivial. If we vary t 2 away from t \, 
then U^(t 2 ,ti)U(t 2 ,ti) could change, but since 3 


_d_ 
d t 2 




fw 

clt 2 


d U 

dt-2 


U^HU U^HU 

-:- 1 -:- 

l l 


0, (8.5) 


Q 

Note that in deriving eqn 8.5 we are 
using property 3 of the time evolution 
operator in the form 

dU _ HU df/t _ U^H 

dt i ’ d t i 


it won’t. QED. One consequence of this result is that U'(t 2 ,ti) = 
# -1 (t 2 ,ti), that is, the adjoint is the inverse. 

Property number 3 allows us to write an explicit expression for U(t 2 ,ti), 
since integrating this equation gives 4 

U(t 2 ,h) = e - ii4(t2 - tl) . (8.7) 


4 This exponential expression for an op¬ 
erator is actually shorthand for an ex¬ 
pansion: 

e A = 1 +A+ — AA+ — AAA+... (8.6) 
2! 3! 

We will meet many such exponential 
expressions. 
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8.2 The Heisenberg picture 


The Schrodinger picture puts all the time-dependence into the wave 
functions, but is that the best way to look at things? Consider the 
expectation value of an operator 

(d(t)) = (m\o\m)- ( 8 . 8 ) 

This tells us the average value that we’ll get if we make a measurement 
of the quantity corresponding to the operator O at a time t. However we 
construct our quantum mechanics, we must end up with this equation 
for expectation values to guarantee that we get the same prediction for 
the result of any measurement. 5 

We can also formulate things as follows by only worrying about the 
wave function at t = 0. After all, if we know ip( 0), we can always 
calculate ip(t) using 

= U(t, 0)^(0) = e~ i6t ip(0). (8.9) 

Applying this to the expectation value in eqn 8.8, we can write 

m)\om)) = (m\uHt,o) o o)>. (s.io) 

We can interpret this equation in two ways. In the Schrodinger picture 
we think of time-independent operators Os = O and time-dependent 
wave function states IV’s(O) — U{t, 0)|V’(0)), so that 

(il>( 0 )\u\t, 0 ) [o] u(t, 0 )|^( 0 )) = (Mt)\o s \Mt)), ( 8 . 11 ) 

where the ‘S’ subscripts stand for ‘Schrodinger picture’. To work out how 
the states evolve with time, we use the Schrodinger equation, eqn 8.1. 
However, we can also move the square brackets in eqn 8.11, so that it 
reads 

(^( 0)1 [u\t, 0 )OU(t, 0 )] 1 ^( 0 )) = (^hIOhWIV’h), ( 8 . 12 ) 

Werner Heisenberg (1901-1976) where the ‘H’ stands for the Heisenberg picture. We have made the 

wave functions = ^(0) time-independent (essentially by freezing the 
wave functions at their t = 0 values) and all the time dependence is now 
formally located in the operators On(t) which are given by 

du(t) = U\t,0)d s U(t,0), (8.13) 


^ Another example of a quantity we 
want to agree on is the amplitude for 
a system being in state \(f>(t 2 )) at a 
time t, 2 , having been prepared in a state 
at an earlier time t±, which is 
given by (</>(t2)|'*/ , (^i))- (Strictly we 
should only insist on the probability be¬ 
ing the same, so we could allow a dif¬ 
ference in phase.) 


^ Note that we are once again using 
property 3 of the time evolution opera¬ 
tor in the form 

d U _ HU d*yt _ JJ^H 

dt i ’ d t i 

We do this using eqn 8.13 and the fact 
that U and H commute. 


a situation which is analogous to the dynamical variables in classical me¬ 
chanics. The time dependence of Oa(t) is simply given by differentiating 
eqn 8.13, so that 6 

^ O s U + tfds^r = l(-&Hd s U + U^O s HU), (8.14) 
dt dt dt 1 

and hence we arrive 7 at the result 


d Quit) 
dt 




which is known as the Heisenberg equation of motion. 


(8.15) 
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8.3 The death of single-particle quantum 
mechanics 


The time-evolution operator gives us enough ammunition to kill off 
single-particle quantum mechanics forever. We ask the question: ‘what 
is the amplitude for a particle to travel outside its forward light-cone?’ 
(see Fig. 8.2), or in other words: ‘what is the amplitude for a particle 
at the origin in spacetime (x — 0, t = 0) to travel to position x at time 
t which would require it to travel faster than light?’ Faster than light 
means \x\ > t and the interval (x — 0, t — 0) is space-like. 8 If the ampli¬ 
tude is nonzero then there will be a nonzero probability for a particle to 
be found outside its forward light-cone. This is unacceptable and would 
spell the death of quantum theory as we’ve known it so far. 

To answer the question we are going to evaluate the expression 

A = <£c|e — |a; = 0), (8.16) 


for the space-like interval. We’ll work with a basis of momentum states 
H\p ) = E p \p) where the particle has a relativistic dispersion E p = 
\Jp 2 + m 2 . We’ll also need the transformation between position and 
momentum bases: 9 ( x\p) = ^ 9 ^ 3/2 e lp x . 

We start by inserting a resolution of the identity 10 


{x e 


—i Ht | 


x = 0) = / d 6 p(x |e 1 \p)(p\x = 0) 


J d 3 p(x\p)e lEpt 


(2tt) 3 / 2 


a-'P'O 


d 3 p 


(2tt) : 


_gi px e -iEpt 


(8.17) 


This has been fairly straightforward so far. Now we need to do the 
integral. 


Example 8.1 

The technique for doing an integral like this is very useful in quantum field theory, 
so we’ll go through it in detail. We start by converting to spherical polars 

.4 = d<j> ^*L|p| 2 f 1 d(cos0)e i l p l! !E l cose e- i£; P t 

Jo Jo (2 tt j- 3 y_! 

= /' 00 d|p|| P |(e i l»*ll«l-e- i l» > ll«l)e- ii5 P t 

(2? t) 2 i\x\ Jq \ / 

_• r oo j ---— 

= d|p| |p|e i lPll a ’le- i *VlPl 2 +- 2 . ( 8 . 18 ) 

yZTTJ \X\ 7—00 

To do this integral we need to resort to complex analysis. 11 The integration path 
is shown in Fig. 8.3(a). There are cuts on the imaginary axis starting at ±im and 
heading off to =too. We can deform the contour into a large semicircle in the upper 
half-plane that goes down the upper cut on the left side and up the cut on the right 
[Fig. 8.3(b)]. 


^This means that \x\ > ct in unnatural 
units. Our convention is that if x = 
(t,x) then x 2 = t 2 - (x 1 ) 2 - (x 2 ) 2 - 
(a; 3 ) 2 and so the interval is space-like if 
x 2 < 0. 


t 

i 



Fig. 8.2 A particle shouldn’t be ob¬ 
served outside its forward light-cone. If 
it starts at the origin, it should only be 
able to make time-like journeys. 

^This assumes a particular normaliza¬ 
tion. For our current purpose, where all 
we will do is to show a particular quan¬ 
tity is nonzero, the value of the normal¬ 
ization constant is irrelevant. 

1( \e. since we have a complete set of 
states then 1 = f d 3 p|p)(p|. 


See Appendix B for a primer. 
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Fig. 8.3 The contour integration. 

12 

The Compton wavelength A of a par¬ 
ticle is given by A = h/mc, i.e. h/m in 
units where c = 1. 


The term e 1 l p H a; l decreases exponentially as we head off to large imaginary values 
of |p| because we have \x\ >0. The term e — p\ 2 +m 2 a j so decreases exponentially 
as you head up the left side of the cut but increases as you go up on the right side. 
(This is because Im ^\/|p| 2 + m 2 ^ < 0 on the left side and Im ^\/|p| 2 + m 2 ^ > 0 
on the right side.) The increase on the right side isn’t a problem since it’s multiplied 
by e 1 l p H a; l and for our condition that \x\ > t the product e _1 l p l \ x \ e - lt V\p\ 2 + rn2 
decreases. Jordan’s lemma therefore tells us that as we make the half-circle large 
it doesn’t contribute to the integral, therefore, and we only have to worry about 
the part going up and down the cut. This allows us to deform the integral further 
[Fig. 8.3(c)]. We now substitute \p\ = iz and the integral becomes 


A 


(2rr) 


f°° d(iz)ize~ zM ( e t'S* 
Z \x\ Jm \ 


— m^ — ty/z ^ —r 


2tv 2 \x\ 


— m\x | 


n oo , . 

/ d 2 ze- (z “ m)|a!| sinh (tVz 2 - m 2 ) . 

J m ' / 


(8.19) 


The integrand is positive, definite and the integral is nonzero. This means that A \ 2 
yields a nonzero number, heralding doom for the single-particle picture. 


We’ve discovered that the probability of finding a particle outside its 
forward light-cone is proportional to e~ m l x L Although this is small for 
large \x\ it’s certainly unacceptable that it exists at all. We are unable 
to reconcile single-particle quantum mechanics and special relativity. 
Single-particle quantum mechanics is dead! 

There is another way of understanding why single-particle quantum 
mechanics fails when faced with special relativity. Imagine squeezing 
a particle of mass m into a box with side smaller than the particle’s 
Compton wavelength. 12 The uncertainty in position Ax < A so A p 
h/X = m and so the energy of the particle is now much greater than m. 
This is enough, according to special relativity, to lead to pair production 
(i.e. the creation of particle-antiparticle pairs). Thus, on average, the 
box has more than one particle in it. Single-particle quantum mechanics 
can never describe this situation, by definition! 


8.4 Old quantum theory is dead; long live 
fields! 


13 

The same sentiment was also ex¬ 
pressed by Hippocrates and by Shake¬ 
speare (in Hamlet). 


^The Andromeda galaxy is some 2.5 
million light-years from Earth. 


A desperate disease requires a dangerous remedy 
Guy Fawkes (1570-1606) 13 

What are we to do instead? We resort to fields. In classical mechanics 
we assume that an observer has access to all possible measurements of 
a system. Special relativity teaches us that no one person can measure 
everything. If information would have to travel faster than c for a mea¬ 
surement to be made it is impossible to make such a measurement. In 
fact, you make a measurement at a single point in spacetime where you, 
yourself, are localized. You can’t then make a measurement at a point 
that has a space-like separation from you. More simply, an observer can’t 
make a measurements here on Earth and then make a measurement of an 
event in Andromeda 14 that occurred one second later. The Andromeda 
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information would have to travel to the Earthbound observer faster than 
c, which is forbidden by special relativity. 

Have we seen this limitation elsewhere in physics? Absolutely. In 
electromagnetism we calculate and measure the value of an electric or 
magnetic vector at a spacetime point x. We call these E(x) and B(x) 
respectively and they are examples of fields. They are machines that take 
spacetime coordinates x M = (t,x) and output the value of the electric 
or magnetic vector at that point. The field machines are defined locally, 
and so they only tell us about the electromagnetic vectors at the point 
x. We know that we can’t design an experiment that measures E(x) for 
x = (now, here) and for x = (now, Andromeda ). 15 

We need to fix up this idea of a locally defined field to make it quantum 
mechanical. In quantum mechanics we have operators that correspond 
to observables. It seems, therefore, that we will need operator-valued 
fields if we’re to reconcile quantum mechanics and special relativity. 
We know two things about these field operators already: (i) crucially, 
they must not allow information to travel faster than c; (ii) they must 
be Hermitian if they are to correspond to observables. 16 We can state 
the former property quantum mechanically. If we say that the operator 
0\ corresponds to an observable measured at spacetime point x and 
O 2 corresponds to an observable measured at y, then if x and y have a 
space-like separation [(x — y) 2 < 0] then the operators must commute: 

[d(x),d(y)] = 0 , ( 8 . 20 ) 

that is, the result of one measurement won’t affect the result of the other. 
How could they if the separation between the experiments is space-like? 

To make quantum fields we need to find operators that are defined at 
positions in spacetime. We’ll call these <j>{x). We hope that we can build 
everything from these locally defined fields. 17 In the next few chapters 
we will continue our search for these objects. 


Chapter summary 

• The time-evolution operator is given by U(t 2 ,t\) = e _lff ( t2_il ), 
and in the Schrodinger picture acts to time-evolve states. The 
operators Os are time-independent. 

• In the Heisenberg picture states don’t evolve in time, but the oper¬ 

ators do: On(f) = U\t, 0) O s U(t, 0). Operators obey the Heisen¬ 
berg equation of motion = ^[On(t),H]. 

• Squeezing a particle of mass m into a region smaller that its 
Compton wavelength h/m can lead to pair production, invalidating 
single-particle quantum mechanics. 

• To fix this, we need operator-valued fields. 


15 AP 

Of course, if we solve some equa¬ 
tions of motion for the fields we may 
be able to work out what’s happening 
elsewhere. 


1 It is not essential that a general field 
should necessarily correspond to an ob¬ 
servable. Indeed we will meet many ex¬ 
amples where this is not the case. For 
electric and magnetic fields we do seek 
Hermitian fields. 


17 

This turns out to be the case! 
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Exercises 


(8.1) Show that the form of the time evolution operator 
U(t 2 ,ti) = exp[— \H(t 2 — ti)] (as given in eqn 8.7) 
exhibits properties 1-5 in Section 8.1. 

(8.2) For the Hamiltonian 

H = ^2 E kd\dk, ( 8 . 21 ) 

k 

use the Heisenberg equation of motion to show that 
the time dependence of the operator a\ is given by 

a{(t)=a{(0)e iEkt/h , (8.22) 


(8.3) For the Hamiltonian in the previous problem find 
an expression for time-dependence of the operator 

X — XlmClgCim. 

(8.4) A spin-1 particle is in a magnetic held aligned along 
the y-direction. The Hamiltonian can be written as 
H = uiSy where u> is a constant (proportional to 
the magnetic held). Use the Heisenberg equation of 
motion to show that 


df 

dS| 

df 


-wSh, 

(8.23) 

W'S'hj 

(8.24) 


and find a similar expression for &k(t). 


and give a physical interpretation of this result. 



Quantum mechanical 
transformations 



When Gregor Samsar awoke one morning from uneasy 
dreams he found himself transformed in his bed into a gi¬ 
ant insect. 

Frank Kafka (1883-1924), The Metamorphosis 

Quantum mechanical states transform (though usually less dramatically 
than the transformation that affected Gregor Samsar) when you trans¬ 
late or rotate coordinates. In this chapter we are going to look at how 
quantum states, operators and quantum fields will transform when sub¬ 
jected to translations, rotations or Lorentz boosts. Where do we start? 
One thing we certainly know how to do is transform coordinates in 
spacetime. For example, we can translate a position three-vector cc by a 
three-vector a, which is achieved simply by adding a to the vector. To 
put things formally, we act on x with a translation operator T(a) and 
transform it into x' given by 

x' = T (a)x = x + a. (9-1) 

We also know how to rotate a three-vector through an angle 9 about the 
z-axis, using 

/ cos 9 — sin 9 0 \ / x 1 \ 

x 1 = R(9)x= I sin# cos 9 0 1 [ x 2 1 . (9.2) 

\ 0 0 1 J \ x 3 J 

We need to find operators that can transform quantum states, which 
live in Hilbert space (rather than the Euclidean space of three-vectors 
or Minkowski space of four-vectors). Here we investigate what form 
these operators take. 
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Remark about notation: 

We will be careful to distinguish be¬ 
tween, on one hand, operators T(a) 
and R(0) which act on single vectors 
(like cc) and, on the other hand, quan¬ 
tum mechanical transformation oper¬ 
ators like U (a) and U(0) which act 
on quantum states (like |sc)) and also 
(as we shall see) on operators (like 
cc) and on quantum fields (like </>(#)). 
These quantum mechanical transfor¬ 
mation operators will turn out to be 
unitary, so we shall use the symbol U 
for all of them. 


9.1 Translations in spacetime 

Let’s start with looking at the properties of single-particle quantum 
systems when we translate them in space. Suppose we’re examining a 
particle localized at a coordinate and we want to translate it to another 
point in space. We need a quantum mechanical operator U that takes a 
state localized at x and transforms it to a position x+a. The translation 
operator is written as 


U(a)\x) = \x + a). 


(9.3) 
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lr The transformation in eqn 9.2 is a 
right-handed, active rotation or a left- 
handed, passive rotation. 



Fig. 9.1 (a) A passive rotation acts on 
the coordinates, (b) An active rotation 
acts on the object. 

2 A unitary matrix is defined as one for 
which U* = E/ -1 . Note that some 
transformations in quantum mechanics 
must be performed by an antiunitary 
operator as discussed in Chapter 15. 

^A group is a set G together with 
an operation • which connects two el¬ 
ements of the set (let’s say a, b G G) 
and produces another element of the set 
(written a • b ). This has to satisfy four 
conditions: 

(1) Va, 6 G G, a • b G G (closure). 
The symbol V means ‘for all’. 
The first property written in full 
is ‘For all members of the group 
a and b, the combination a • b is 
also a member of the group’. 

(2) Va, b, c G G, a® (6« c) = (a* b) • c 
(associativity). 

(3) Be G G such that Va G G, ame = 
e • a = a (identity). 

(4) Va G G, 3a —1 G G such that a • 
a -1 = a -1 • a = e (inverse). 

Note that one of the conditions was 
not that a • b = b • a. If that condi¬ 
tion (commutativity) holds, then one 
has a special type of group called an 
abelian group, but many groups (in¬ 
cluding ones which are important in 
quantum field theory) are non-abelian. 

^Named after the Norwegian mathe¬ 
matician Sophus Lie (1842-1899). 
Note that Lie is pronounced ‘lee’. 


We will treat this as an active transformation where we move a particle 
to a new position (instead of changing the coordinates under the particle, 
which is known as a passive transformation) 1 (see Fig. 9.1). 

Next we examine the properties we want in a translation operator. 
Translating the particle shouldn’t change the probability density. Thus 

(ip(x)\ip(x)) = {ip(x + a)\il>(x + a)} = {ip(x)\U^(a)U(a)\ip{x)), (9.4) 

which means that we may write 

U\a)U(a) = 1, (9.5) 

that is, the translation operator is unitary . 2 

We know that a translation through a distance a followed by one 
through a distance b can be represented by a single translation through 
a distance a + b. We want our operators to have this property too, so 
we add to our wish list 

U{a)U(b) = U(a + b). (9.6) 

We should also have the trivial property that the transformation can do 
nothing to the particle, that is 17(0) = 1. 

To recap then, we have identified three rules for our translation oper¬ 
ator: 

• U(a)U^(a) = 1 (17 is unitary), 

• U{a)U(b) = U{a + b) (composition rule), 

• 17(0) = 1 (a zero translation does nothing). 

Notice that these rules are telling us that our transformations should 
form a group. 3 It is a rather special group because each element depends 
on a vector a, and thus the group is continuous, differentiable and has 
an infinite number of elements. Such a group is called a Lie group . 4 


Example 9.1 

Let’s check that U(a) has the required conditions of a group. A group has four 
properties: the existence of an identity [E/(0) = 1], closure (we can make any element 
of the group by multiplying two other elements of the group), associativity [U(a + 
b)U(c) = U(a)U(b-\- c)] and the existence of an inverse for each element of the 
group. We also know that the inverse of a translation T~ 1 (a), is just a translation 
through —a. Putting this together with the unitarity of the operator, we can write 
that U~ 1 (a) = C/t (a) = E/(—a). 

Let’s now try out the new operator. It’s supposed to move a state though a and 
we can use the position operator x to check where it is: 

17(a) | ®) = |a5 + a) 

xU(a)\x) = x\x + a) = (* + a)\x + a) 

U^(a) x U(a)\x) = (tc + a)U\a)\x + a) = (* + a)\x), (9-7) 

or comparing the effects of operators on the state |aj), we see that 

E/t (a) x U(a) = (® + a). (9-8) 

That is, the space translation operator can be thought of as a transformation of an 
operator like x instead of a transformation of a wave function like |as). 
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Example 9.2 

Often translating will have no effect on a property that we determine using an oper¬ 
ator O. In that case the property probed by that operator is an invariant. 5 In that 
case we would have 

(i/j(x)\d\ip(x)) = {i/)(x)\U~ 1 (a)dU(a)\ip(x)). (9.9) 

Pulling out the operators we see that the condition for an invariant is 

U~ 1 (a)OU(a) = O. (9.10) 

Act on the left with U and thus invariance implies that OU = UO, or [0,17] = 0. 
Recall from eqn 8.15 that operators that commute with the Hamiltonian operator H 
represent constants of the motion, i.e. quantities that are conserved. 


Can we come up with an explicit expression for a translation operator 
that acts on quantum states? We will do this for the special case of a 
position wave function il>(x) = (x\p) (the result will turn out to also allow 
us to transform general quantum mechanical states). We increment the 
point at which we evaluate ip(x) by an infinitesimal amount Sa in the 
^-direction, for which we may write 

ip(x + Sa) = ip(x) + g a -|-_ (9.11) 

ax 

Remembering that the momentum operator p = — i^L, we have 

ij){x + Sa) = (1 + ipSa) 4>(x). (9-12) 

We say that the momentum operator p is the generator for the space 
translation. To make a translation through a distance a we can translate 
through Sa a large number N times, such that 

tf>(x + a) = lim (1 + i pSa) N ip(x) 

N—*oo 

= e i ^°ijj(x). (9.13) 

This gives us a space evolution operator. However, this is not quite the 
U(a) that we’re trying to find! We want to translate the entire quantum 
state through a distance a not study how it evolves over a distance a! 
The translation operator is, therefore, U(a) = e~' p a . 


Example 9.3 

Acting on a momentum state with our operator we obtain 

U(a)\q) = 

= e i "" q). (9.14) 

Projecting along the coordinate direction gives us a translated wave function 


(as|?7(a)|q) = { X \q)e~^ a = ^ q<x ~ a) • 


(9.15) 


^ See Chapter 10. 
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^ It might be helpful to view time evo¬ 
lution as passive : one studies how ^ 
changes as coordinate t increases. Time 
translation is then active : one moves a 
quantum state through time. 


Remark about notation: 

Spacetime translations, rotations, 
Lorentz boosts and combinations of 
the above are all elements of the 
Poincare group. They can all be 
represented by unitary operators. To 
keep ourselves from an overload of 
brackets and indices, we will simplify 
the notation by denoting all these 
unitary operators by U and let the 
operator’s argument indicate whether 
we are dealing with a translation 17(a), 
rotation U(0) or Lorentz boost U(<p). 
This should not cause confusion as the 
context will make it clear which type 
of transformation we are dealing with. 


Previously we had a time-evolution operator Ufati) = e~ lH ^ t2 ~ fl >. We 
can see how this relates to the argument in the previous section. Let’s 
evolve a system through a time 6t a . We obtain 

^(t + St a ) = + ^jp-Sta + • • • (9.16) 

at 

Now we need to remember that H = i4i, which gives us 

ip{t + St a ) = (l - iHSta') (9-17) 

Remembering the change of sign required to turn this from a time- 
evolution operator 6 into a time translation operator we easily obtain 

U(t a )=e i ^. (9.18) 

We can put the space and time translations together and define a space- 
time translation operator U(a) = e ip '“ = e 1 ' f/ta_lpa , where we choose 
the definition of the four-momentum operator to be p = (if, p). We will 
return to the consequences of this exponential form for the expression 
for U(a) later in the chapter. 

9.2 Rotations 

Next we ask how to rotate objects. A rotation matrix R(0) acts on a 
vector quantity, such as the momentum of a particle as follows: p' = 
R (9)p. We specify the rotation as R(0), where the direction of the 
vector 6 is the axis of rotation and its magnitude is the angle. For 
rotations of quantum states, we propose an operator 

\p') = U(G)\p) = \tl(6)p). (9.19) 

As before, we require that U(0) has: (i) unitarity, (ii) an identity element 
and (iii) a composition rule. What we’re looking for then is that 

• U^G)U(G) = 1, 

• U(0) = 1 , 

. 11 ( 9 ^ 11 ( 62 ) = U( 612 ) where R(0 12 ) = R(6>i)R(0 2 ), 

and, just as before, our operators form a Lie group, called the rotation 
group. 


Example 9.4 

We can prove that the operator defined this way is unitary 


U(8)U\0) 


U(8) ^ d 3 p|p}(pj) U\8) 
f d 3 p|R(0)p>(R(0)p| 


(9.20) 
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and since p' = R (0)p and d 3 p' = d 3 p we have 

f d 3 p |R(0)p)(R(6)p| = J dV Ip'Xp'I = 1, (9-21) 

as required. Notice that our proof rests on the fact that d 3 p' = d 3 p, which is true 
since det R(0) = 1 for a proper rotation and so the Jacobian is unity. 


For the translation case we had 

(a) xU(a) = x + a, (9.22) 

and we find 7 an analogous expression for the rotations, i.e. 

U\Q)pU(6) = H(6)p. (9.23) 

Thus the momentum operator is transformed in just the same way as 
one would rotate a momentum vector. 

Finally, we may find an explicit expression for the rotation operator 
that acts on wave functions. Let’s examine a rotation of a wave function 
about the 2 -axis and expand 

ip(0 z + 50 z ) = V>(6> z ) + ' ) S6 Z + ... (9.24) 

Here we recall that J z = —i^4^, from which we see that rotations are 
generated by the angular momentum operator. We obtain 

ip(0 + 5d z ) = (l + iJW) ip(6 z ), (9.25) 

and finally, repeating N —* oo times and changing the sign gives U ( 9 Z ) = 
e~ lJ s , and for a rotation about an arbitrary axis we have 

17(0) = e~ i3e . (9.26) 


9.3 Representations of transformations 

We have seen how to transform spacetime coordinates by translations 
and rotations. But spacetime is an empty coordinate system without 
filling it with quantum fields. So we now have to ask how the quantum 
fields themselves transform. A quantum field 4>(x) takes a position in 
spacetime and returns an operator whose eigenvalues can be a scalar, 
a vector (the W ± and Z° particles are described by vector fields), a 
spinor (the object that describes a spin-^ particle such as an electron), 
or a tensor. A spatial rotation simply rotates the coordinates for a scalar 
field [Fig. 9.2(a)] but for a vector field simply rotating the coordinates is 
not enough [Fig. 9.2(b)], you also have to rotate the vectors [Fig. 9.2(c)]. 

For this reason we need to think a bit more deeply about how trans¬ 
formations are represented, and for concreteness let us keep focussing 


7 

The proof for this goes through much 
like the version in Example 9.1 and is 
left as an exercise. 



Fig. 9.2 (a) A spatial rotation sim¬ 
ply rotates the coordinates of the scalar 
fields but does not alter the value of the 
scalar, (b) Simply rotating the coordi¬ 
nates of a vector field is not a correct 
transformation since (c) you have to ro¬ 
tate the vectors at each point in space 
as well. 
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representation of a group D is 

obtained by mapping each element gi 
of the group G into a continuous lin¬ 
ear operator that acts on some vec¬ 
tor space. This mapping preserves the 
group product, so that if <7i • Q 2 = Q3, 
then D(gi)D(g 2 ) = D(g 3 ). 


^ This makes perfect sense because a 
scalar can have no angular momentum. 


The representation for a rotation of 
a field with angular momentum quan¬ 
tum number j is frequently given the 
symbol ( 6 ). Thus in this example, 
part (a) describes (0) = 1, part 
(b) describes and part (c) 

describes a representation related to 
D^(0) which uses the Cartesian axes 
cc, y and z as the basis of the spatial 
components, rather than the three az¬ 
imuthal quantum numbers. The gen¬ 
eral formulae for the matrix elements 
of D«)(0) are 

(Db)(0)] mm , = (jm'\U(8)\jm) 

We can then use the standard relations 
for angular momentum operators: 

j z \jm) = m\jm), 

J ± | jm) = y(j=Fm)(j + l±m) \j m± 1), 
J ± = J X ± i J y . 

1 This means that there are elements of 
the Lie group which are arbitrarily close 
to the identity and so an infinitesimal 
group element can be written 

g(a) = 1 + i a i T i + 0(a 2 ), 

where T x are the generators of the 
group. The Lie algebra is expressed by 
the commutator 



where are called structure con¬ 
stants. For rotations T 1 = J % and 
pjk _ £ ijk' 


on rotations. Any rotation R(0) can be represented by a square matrix 
D{6). This takes a very similar form to U(0) = e~ iJ 6 , namely 

D(0) = e~ ue . (9.27) 


In this equation, J is a square matrix, a representation 8 of the operator 
J. Equation 9.27 can be rewritten as 


1 dD{9 i ) 

T W 


6 i =o 


(9.28) 


means that 


J z = -- 


= 0. 


(9.29) 


Example 9.5 

(a) Consider rotations about the z-axis. A trivial representation is D(0 Z ) = 1. This 

1 dD(9 z ) 

99 z 

By extension J x = J v = 0. This representation is appropriate for a scalar field. 9 

(b) It turns out that the representation of a rotation about the z-axis for a spin- ( 
particle is 

' e - ifl2 /2 o 
0 e iS2 / 2 


and hence 


J z = - 


D(9 Z ) = 

1 9D(9 Z ) 


96 z 


= l( 

' 1 

0 ^ 

e z =o 2 ' 

0 

-1 ) 


(9.30) 

(9.31) 


which we recognise as the operator for the z-component of angular momentum for 
a spin-1 particle. This representation of the rotation group will be useful for the 
spinor fields introduced in Chapter 37. 

(c) For rotations about the a;-axis we have 

/ 0 0 0 

0 

cos 9 X 
sin 9 X 


ll(0 x ) = 


0 1 
0 0 
\ 0 0 


0 

0 

— sin 9 Z 
cos 9 X 


\ 


(9.32) 


! 


j x = —- 


so we obtain 


Repeating for the y- and z-axes yields 


jy = i 




( 

0 

0 

0 

0 

\ 

1 9R(9 X ) 



0 

0 

0 

0 

i 96 x 

— 1 

e x =o 


0 

0 

0 

-1 



K 

0 

0 

1 

0 
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( 0 

0 

0 
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0 

0 

0 

0 

\ 

0 

0 

0 

1 

J z = i 


0 

0 

-1 
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0 

0 

0 
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0 

1 
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V 0 

-1 
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0 

0 
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(9.33) 


(9.34) 


The important point about these representations is that they all share 
the same underlying algebraic structure as the rotation operator. This 
algebra is called a Lie algebra, and can turn up whenever you have a 
continuous group. 10 The Lie algebra is encoded within the commutation 
relations of the generators. In other words, whether a generator of rota¬ 
tions rotates abstract quantum states in Hilbert space (like J z = — igfj-), 
real space vectors (as in the previous example) or complex spinors (as in 
Chapter 37) each set of generators should have the same commutation 
relations: \J\J 3 } =ie ijk J k . 
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9.4 Transformations of quantum fields 

We have seen that a transformation associated with a unitary operator 
U will turn a locally-defined operator O (such as the position operator) 
into &OU ( see eqn 9.8). Now we need to determine how such transfor¬ 
mations affect quantum fields. We’ll start by examining a scalar field: 
an operator-valued field whose matrix elements are scalars. 


Example 9.6 

If we translate both a state and an operator by the same vector distance a then 
nothing should change. In equations 

(y\cj>{x)\y) = {y + a\<j>(x + a)\y + a). (9.35) 

Since \y + a) = U(a)\y) we must have 

U(a)<j>(x)U^ (a) = 4>(x + a). (9.36) 

Notice that this looks like U and U' have been bolted on the wrong way round when 
compared to the expression 

U\a)xU(a) = {x + a), (9.37) 

but these two expressions are talking about different things. The position operator 
x tells us where a particle is localized, the field operator (j>{x) acts on a localized 
particle at x. There’s no reason that they should transform in the same way. 


The previous example demonstrates that there are two ways of thinking 
about translations operators, and we shouldn’t confuse them. They can 
be pictured: 

• As acting on states: U(a)\x) = \x + a) moves a locally defined 
state from being localized at x to being localized at x + a. 

• As acting on locally defined operators: 


(a) 





>=1 




U\a)<j>{x)U (a) = (j>(x — a). (9.38) 

These are not the same as we can see from Fig. 9.3. We can imagine 
a field operator <f>(x) as a sniper’s rifle shooting a state at a position x 
[Fig. 9.3(a)]. If we translate the state from |x) to |x + a), 4>(x) misses the 
state because we’ve moved the state [Fig. 9.3(b)]. If, on the other hand, 
we translate the operator from 4>(x) to <j>(x — a), we miss again, but this 
time because we’ve changed the position we’re aiming at [Fig. 9.3(c)]. 
Since we’re interested in the fields, changing the operator will be the 
most useful procedure and so the equation to focus on is eqn 9.38. 


Fig. 9.3 The field operator <fi(x) act¬ 
ing on a localized state is pictured as a 
sniper’s bullet shooting a duck located 
at position x. (a) Dead on target, the 
state of a dead duck has perfect overlap 
with the state produced by acting (f)(x) 
on the live duck at x. (b) Translating 
the state results in a miss, as does (c) 
translating the operator. 


Example 9.7 

The creation and annihilation operators are also actually quantum field operators, 
albeit in momentum space. That is, the operator takes a momentum m and 
outputs an operator that creates a particle at that position in momentum space. 
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We’ll examine what we get if we transform according to C/"l’(a)aJ n 0’(a), which is 
a translation in spacetime by a vector a. We can try out this translated operator on 
a momentum eigenstate | q) (assumed different to m): 


l«) = 


= e ip 

“aLe"*“|q> 


= e ip 

“<4 Me-*- 


= e ip 

°|m, q)e~ iq a 


= I'm. 

Q y(rr,+q)o. e -iqa 


= |m. 

, q)e irn a . 

(9.39) 

creates 

a state with momentum m. 

but the 


result of the transformation is an additional phase e irn a . We conclude that 


UHa)al n U(a) = e ima al n . 


(9.40) 


This discussion has been for scalar fields, but we must remember that 
fields come in many forms, including vector fields as discussed in the 
following example. 




Fig. 9.4 (a) A classical scalar field has 
a maximum at some point in space re¬ 
moved from the origin by a vector d. 
(b) Rotating the distribution with an 
operator R (6) moves the maximum to 
a position R(0)d. (c) A classical vec¬ 
tor field also has a maximum value at d. 
(d) The transformed field has a maxi¬ 
mum at R (0)d and this vector is ro¬ 
tated to R(0)V ma x- 


Example 9.8 

A classical scalar field </>(*) (like the distribution of temperature across a metal block) 
has a large maximum <f)(d) = </> max at some point in space removed from the origin 
by a vector d. Rotating the distribution with an operator R (6) moves the maximum 
to a position R (Q)d. This is embodied in the operator equation W (0)<fi(x)U(0) = 
</>(R -1 (0)x), which demonstrates that we need to enter the position x = R (0)d in 
the transformed field to find the maximum [see Fig. 9.4(a) and (b)]. 

For a classical vector field V, such as the velocity distribution in a liquid, let’s 
imagine that we have a maximum velocity V(d) = V ma x. Being a vector, this has 
a direction. A rotation R (6) now moves the position of the maximum and, also 
its direction. The analogous mathematical description is now U* (0)V(x)U(0) = 
R(0)VfR -1 (0)x), demonstrating that the transformed field has a maximum at 
R (0)d and that this vector is rotated to R(0)Vm ax [see Fig. 9.4(c) and (d)]. 


It is therefore important to note that if we transform a field in some 
way, we generally change the point at which the field is evaluated and 
also its polarization. We can write a more general rotation of a general 
field operator 3>(x) as 

ffl{0)*{x)U{p) = D(0)<&(R~ 1 (9)x), (9.41) 


where D{6) is the appropriate representation of the rotation operator, 
as discussed in Section 9.3 (e.g. for scalar fields D{6) = 1). 


As we are now explicitly considering 
spacetime, we will now switch from la¬ 
belling our spatial axes x, y and 2 : and 
go back to using 1, 2 and 3, with 0 for 
the time axis. 


9.5 Lorentz transformations 

A similar philosophy carries over to the case of Lorentz transformations 
of states and quantum fields. 11 Consider a boost of a four-vector in the 
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^-direction whose Lorentz transformation is given by x'^ = A (P 1 ) ,1 ^x l/ , 


where 


ACA 


( 7 1 PW 0 0 \ 

/?V 7 1 0 0 

0 0 10 

\ o o oi/ 


(9.42) 


This transformation connects two inertial frames moving with relative 
speed v = cp 1 along x. Using the substitutions j l = cosh^*, 7 i P l = 

where p l is called the rapidity, this matrix 


sinh (p and tanh^ = P l 
becomes 

A(^) = 


/ 


cosh p 1 
sinh cjr 
0 
0 


sinh ^ 1 
cosh p 1 
0 
0 


\ 


0 
0 
0 
1 J 


(9.43) 


We write an operator that acts in Hilbert space : 12 U(p) \p) = |A (<j>)p). 
We also want a generalized matrix form like we had with the rotation 
e -ij e F r om our experience with rotations we suppose that we can write 
a generalized Lorentz transformation matrix as 13 


W) = e iK *, 


(9.44) 


12 

We will not pursue this operator fur¬ 
ther here. Unlike linear and angular 
momentum the boost three-vector is 
not conserved, so we do not use its 
eigenvalues to label states. 

13 

Note the plus sign in the exponent 
compared to the rotation case. 


and the generators of the Lorentz transformations are given by 


K l 


1 dDtf) 

i dp 1 


‘=o 


(9.45) 


As shown in the exercises, we can find the explicit forms for these gen¬ 
erators for the case of vectors by taking D(p l ) to be the A(</)*) matrices 
such as that in eqn 9.43 for a boost along x. We therefore require our 
quantum fields to Lorentz transform according to 


U*{4>)*(x)U(4>) = D(4>)&(A~ 1 ((f>)x). (9.46) 


Finally, we recall that the fundamentally important feature of the 
generators of a transformation is their commutation relations. Upon 
working out a set of generators for the Lorentz transformations, as you’re 
invited to in the exercises, it comes as a shock to find 

[K 1 , K 2 } = I^K 2 - A 2 AT 1 = -i J 3 . (9.47) 

In other words, the difference between (i) boosting along x and then 
boosting along y and (ii) boosting along y and then boosting along x is 
a rotation about z. Mathematically speaking, the fact that [K 1 , K 2 ] = 
— iJ 3 implies that the Lie algebra of the Lorentz transformations isn’t 
closed and their generators do not form a group. Let’s see what happens 
if we examine some other commutators. Pressing on, we find 

[J/A 1 ] = 

[A A 2 ] = 


0, 

iA 3 , 


(9.48) 

(9.49) 
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along with cyclic permutations, that is to say [J l , A' J ] = \e l ^ k K k . This 
implies that the boosts and rotations taken together form a closed Lie 
algebra and it is this larger group that is called the Lorentz group. 
We may write a general Lorentz transformation combining both boosts 
and rotations as 

D(6,4>) = (9.50) 

where, as always, the generators J and K are those appropriate for the 
object being transformed. If you also include the spacetime translations 
in the mix, you end up with an even larger group called the Poincare 
group. 


Chapter summary 

• A transformation is associated with a unitary operator U which 
acts on states via \x') = U\x). 

• A general spacetime translation has U{a) — e lj5 a . 

• A rotation has U(6) = e _1 ^ e . 

• A Lorentz transformation has U{<$>) = e lK ^. 

• Transformations can be represented by a matrix D(a, 6 , </>) appro¬ 
priate to the type of field. For scalar fields D = 1. 

• The quantum fields that we seek will have to have the following 
transformation properties: 

U*(a)4>(x)U(a) = D(a) - a), 
u\o)${x)u{d) = D(e)^(R~ 1 (e)x), 
U\<t))3>{x)U{<t>) = D{4>) 

i.e. they obey the translation, rotation and Lorentz transformation 
properties of a local operator. 


Exercises 

(9.1) Deduce that the generators of the translation oper- (9.2) Show that explicit forms of the generators of the 
ator are given by Lorentz group for four-vectors are 


1 dU(a) 


i da 


K 1 = 


(9.51) 




( 0 

i 

0 

0 \ 

1 dU^ 1 ) 


1 

0 

0 

0 

i dcj) 1 


0 

0 

0 

0 


(f> 1 = 0 

l o 

0 

0 

0 / 


(9.52) 
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and similarly 


K = -i 


and 


K = -i 


/ o 
0 
1 

V o 

/ o 
0 
0 

V i 


0 
0 
0 / 

0 

0 ) 


(9.53) 


(9.54) 


(9.3) Show that an infinitesimal boost by v 3 along the 
x-'-axis is given by the Lorentz transformation 


( 


A"* = 


v 

1 

0 

0 


v 

0 

1 

0 


\ 


(9.55) 


Show also that an infinitesimal rotation by 6 3 about 
x J is given by 


( 1 

0 

0 

0 \ 

0 

1 

e 3 

-6 2 

0 

-e 3 

i 

e 1 

l 0 

e 2 

- 8 1 

i / 


(9.56) 


Hence show that a general infinitesimal Lorentz 
transformation can be written x /(i = A'VaA where 
A = 1 + w where 


/On 1 v 2 v 3 \ 

v 1 0 9 3 —d 2 

v 2 -e 3 o e 1 
\ v 3 e 2 - e 1 o 


(9.57) 


Show that cu'"' = u> ll \g Xl ' and = g ^,are 
antisymmetric. An antisymmetric 4x4 matrix has 
six independent parameters. This makes sense as 
we are encoding three rotations and three boosts. 


Show that the relationships between 8 1 , v‘ and u/ J 
are 

= - ^e ijk u> jk and v* = iu 0i . (9.58) 

(9.4) A transformation of the Poincare group combines 
translations (by vector a M ), rotations and Lorentz 
boosts and can be written . An 

infinitesimal transformation of the Poincare group 
is thus x = x M + a M + to^x 1 ', where ai'V is given 
in the previous problem. Therefore a function f(x) 
transforms to 

f(x) = f(x + a + u}x) (9.59) 

= f(x) + a M 9 M /(x) + uj^x^d^fix). 

Use the antisymmetry of u>p V to write 

f(x) = [1 + - ^a> M „(x' 1 d ! ' - x"d I ')]/(x), 

(9.60) 

and hence 

}(x) = [1 - ia> M + ^u^MH/(x), (9.61) 

where = — ic^ and = — i(x M cU — x"<9 M ) are 
generators of the Poincare group. Note that 
is antisymmetric and is related to the generators J 
and K by 

ji = ( 9 . 62 ) 

so that J = (M 23 ,M 31 ,M 12 ), and 

K l = M°\ (9.63) 

so that K = (A/ 01 , M 02 , M 03 ). Also show that 

A = (9.64) 

with A = e-K-J e-K^). 
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1 Emmy Noether (1832-1935). Her 
name should be pronounced to rhyme 
with Goethe, Alberta and Norris 
McWhirter. 


2 

Consideration of local symmetries, 
where a transformation changes from 
point to point, will be postponed to a 
later chapter. 

Let’s get the terminology straight. A quantity is invariant when it takes 
the same value when subjected to some particular transformation, and is 
then said to exhibit a particular symmetry. A translationally invariant 
system looks the same however you translate it. A rotationally invariant 
system appears identical however it is rotated around a particular axis. 
Electric charge is said to be Lorentz invariant because it takes the same 
value when viewed in different inertial frames of reference. A system is 
said to possess a particular symmetry if a property is invariant under a 
particular transformation, so for example a cylinder possesses rotational 
symmetry about its axis. A quantity is said to be conserved when 
it takes the same value before and after a particular event. Thus in 
a collision between particles, the four-momentum is conserved within a 
frame of reference but it is not invariant between frames of reference. 

Having stressed that invariance and conservation are entirely different 
concepts, it is now time to point out that they are in fact connected! 
For example: 

• Conservation of linear momentum is related to invariance under 
spatial translations. 


The Universe is built on a plan the profound symmetry of 
which is somehow present in the inner structure of our intel¬ 
lect 

Paul Valery (1871 -1945) 

In the previous chapter we saw that translations were achieved with 
an operator that contains the momentum operator, and rotations were 
achieved using the angular momentum operator. A coincidence? Actu¬ 
ally not, and in fact we can go further. If a system possesses some kind 
of invariance (so that for example it is invariant to translations) then 
a particular quantity will be conserved. This idea is bound up with a 
result called Noether’s theorem, named after the great mathematician 
Emmy Noether. 1 This chapter will explore this theorem in detail. 

A point to note from the outset. In this chapter we will be using the 
word symmetry to mean global symmetry, i.e. a symmetry possessed 
by the entire system under consideration. Thus rotational symmetry 
refers to a symmetry associated with rotation of the coordinates applying 
to every point in our system. 2 

10.1 Invariance and conservation 
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• Conservation of angular momentum is related to invariance under 
rotations. 

• Conservation of energy is related to invariance under time trans¬ 
lations. 

The general idea is clear: invariances lead to conservation laws. 


Example 10.1 


(i) The Euler-Lagrange equation (eqn 1.28) can be written as 

d L _ d_ f dL\ 
dx^ d t \die) 


( 10 . 1 ) 


and using the canonical momentum p^ = dL/dx^ (eqn 5.3), we can write 
eqn 10.1 as 

This has the immediate consequence that if L does not depend on then 
dL/dx p = 0 and hence p^ is a constant. In other words if the Lagrangian 
is invariant under a particular component of spacetime translation, the corre¬ 
sponding component of four-momentum is a conserved quantity. 

(ii) Another example of this follows from eqn 5.58 of Exercise 5.1 which showed 
“““ Si dH 

«--dT “"- 31 

Thus if L does not depend explicitly on time, then the Hamiltonian H (i.e. 
the energy) is a conserved quantity. 


Under a symmetry transformation, various quantities will change. 3 One 
efficient way to describe this is to say that a held ^(x^) will change under 
a symmetry transformation by an amount parameterized by a quantity 
A. For example, for a translation along a spacetime vector we could 
write the transformation 


3 

In this chapter, we are going to as¬ 
sume that the transformations are con¬ 
tinuous. Thus we are ignoring discrete 
symmetries which will be considered in 
Chapter 15. 


</>(x v) —> (j){x^ + Aa M ). 


(10.4) 


The larger A, the larger the degree the transformation is applied. It will 
be useful to consider infinitesimal transformations, and for this purpose 
we adopt the notation 


D<$> 


d(j) 

d\ 


A=0 


(10.5) 


An infinitesimal change in </>, induced by an infinitesimal 5 A, can be 
written as 


6<j> = Dcj) S A. 


( 10 . 6 ) 


Example 10.2 

In the example of our spacetime translation cj)(x fJj ) —► (f)(x M + Aa^), we have 

D(f> = a^d^cf). 


(10.7) 
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This result arises as follows: by writing y 1 ' = x fl ■ Xa 1 ', we have 

d<j> d(j> dyV 
d\ dyi 1 d\ 

and = a 1 '. Taking the limit of A —> 0 gives the required result. 


10.2 Noether’s theorem 

We now turn to Noether’s theorem, which shows that where we have a 
continuous symmetry we also have a conservation law. To identify this 
conservation law we will look for a divergenceless current (i.e. obeying 
d,j, = 0). Such a current is locally conserved (see eqn. 5.35 if in doubt) 
and its time-like component will give rise to a conserved charge. 

Let the field </>{x ) change to c/){x ) + 5cf>(x), where the variation 
5(j){x) vanishes on the boundary of the region in spacetime that we’re 
4 This is a continuous transformation, considering. 4 The change in the Lagrangian density C is then given by 
Noether’s theorem applies to these con¬ 
tinuous transformations but not to the QC dJZ 

discrete transformations examined in = j.\ (10.9) 

Chapter 15. ^ \ fiYJ 


d(d^<t>) 


<5(<9^). 


We will slightly simplify this equation using the substitution 


IP(®) = 


d{dv,4>)' 


( 10 . 10 ) 


This new object (ar) is a generalization of what, in eqn 5.46, we called 
the conjugate momentum tt(x) = SC/Sc/). Our new object IT 1 (a?), the 
momentum density, is the four-vector generalization of 7 r(x). In fact 
tt(x) is the tinre-like component of II M (a;), i.e. 


II 0 (x) = 7r(x). 


( 10 . 11 ) 


This substitution allows us to write eqn 10.9 as 

dC 

SC=—Scj) + W8{d^). 


( 10 . 12 ) 


Using S(d^) = d^(S<j>), and the simple differentiation of a product 


dp{W5<t>) = WdnW) + (d^W)6<f>, 


(10.13) 


we have 


6c=( — - ^ip Sc/ + a„(n »5c/>). 


(10.14) 


The action S should not change under the transformation and so 


SS = J d 4 x Sc/) = 0, (10.15) 
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where the second term in eqn 10.14 vanishes when integrated over a 
large enough volume (using the divergence theorem), and so 

9C 

— = d^W, (10.16) 

which is the Euler-Lagrange equation. Now let’s play this argument the 
other way round and state that <f> obeys the equation of motion. In this 
case eqn 10.14 becomes 


SC = d^WScf)), 

and eqn 10.6 then allows us to write 


(10.17) 


5C = d li {WD(j))5\. (10.18) 

Because we are applying a symmetry transformation, the action S = 
f d 4 xC is unchanged as we have said, and so we don’t expect C to 
change, i.e. SC = 0. However, this condition is too restrictive and so in 
some circumstances we could be persuaded to allow C to change by the 
four-divergence of some function W^(x), so that 

SC = {d„W»)S\. (10.19) 


This is because 


SS = 


d 4 xSC = SX 


d^xd^W^ = 


dAn^W^, 


( 10 . 20 ) 


where A is the surface of the region of spacetime and n M is the outward 
normal (see Fig. 10.1). Since the fields are held constant on the surface in 
order to have a well-defined variational principle, the change in action is 
at most a constant, and so S' = f d 4 cc (SC + d fi W M S A) will be stationary 
under the same conditions that S = J d 4 x SC will be stationary. Putting 
eqns 10.18 and 10.19 together leads to 



Fig. 10.1 A region of spacetime with 
surface A. The outward normal is 


d^U^Dc/)- VP 1 ) = 0, (10.21) 

or more simply = 0 where 


Jfi(x) = n »(x)D<P(x) - W»(x), 


( 10 . 22 ) 


is a locally conserved Noether current. We have deduced a form of 
Noether’s theorem that states the following: 

If a continuous symmetry transformation </> —> <f> + D(j> only changes C 
by the addition of a four-divergence (i.e. DC = d^W 1 ) for arbitrary 
(j>, then this implies the existence of a current Jft = U^Dcf) — W^(x). 

If 6 obeys the equations of motion then the current is conserved, i.e. 

= 0 . 
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. 


N 


?-> 


tO 





time 


Fig. 10.2 The integration is performed 
over a surface in spacetime at constant 
time. The component of the Noether 
current which is perpendicular to this 
surface is the time-like part. 


Using Noether’s theorem to find 
conserved charges 

I: FindD^= f^. 

II: Find IP*(*) = 

Ill: Find W» = DC. 

IV: Write = D<j >lie - W». 

V: Find Q n = fd 3 xj°. 


This assumes that all symmetries of 
the classical theory will turn out to be 
symmetries of the quantum theory too. 
This is true for many important sym¬ 
metries. It turns out not to be the case 
for a small number of cases, which are 
known as anomalies and are discussed 
in more advanced books such as Zee or 
Peskin and Schroeder. 

^ This result is easily seen by recogniz¬ 
ing that conserved charge may be writ¬ 
ten Qn = f d 3 xTl 0 D(j) and recognizing 
that n° is the conjugate momentum to 

4>- 


Conserved currents are important because they give rise to conserved 
charges Qn = j Jl^ dA /( (also called Noether charges). A helpful 
way to apply this idea is to perform the integration over a surface at 
constant time (see Fig. 10.2). In four dimensions, this ‘surface’ is a 
three-volume. Thus 


Q N 



(10.23) 


where is the time-like component normal to the surface. Now since 
dp, = 0, then 


0 = jd 3 x(d^) = J d 3 x(do4+d k J&). (10.24) 


The second term in the integral is / d?xd k J^ = f dA k by the diver¬ 
gence theorem, and this will vanish if the volume is big enough. Thus 
f d 3 xdo = 0 and since do = d/d t, this implies dQN/dt = 0, i.e. a 
conserved charge. 

Everything we have done so far is valid for classical field theories. 
We might wonder if this discussion can be carried over to the quantum 
mechanical fields that are the main subject of this book. Fortunately 
for us Noether’s theorem can be carried over to the quantum realm 5 
with one addition: the so-called ‘normal ordering interpretation’ which 
we introduce in the next chapter. Noether’s theorem then provides us 
with a conserved charge operator Qn, whose eigenvalues are (observable) 
conserved charges. In fact, there is also a quantum mechanical corollary 
to Noether’s theorem which says that if we know the conserved charge 
operator Qn for a quantum mechanical field (j) then we can use it to 
generate the symmetry transformation Dcf) via 6 


Qn, 


= -L D4>. 


(10.25) 


This is illustrated in Fig. 10.3. For now we return to classical field theory 
and continue to investigate the benefits we gain from Noether’s theorem. 


10.3 Spacetime translation 

The field theories that we will examine in this book will be sym¬ 
metric with respect to spacetime translations. That is to say that 
the physical consequences of the theories will be the same whether 
an experiment takes place at a point = (1066AD, Hastings) or 
y M = (1863AD, Gettysburg). Here we examine the consequences of 
this using Noether’s theorem. 

If we subject the coordinates to a variation, so that = x 1 ' + Sx^, 
then we have that D(f> = a^d^cj) and also that 


DC = a^d f ,C = d ll {a^C). 


(10.26) 
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We recognize therefore that DC = where W 11 = a^C and so we 

can write down the conserved current 

= WD<j> - W M 
= WaCd^-aCC 
= aC^d^-S^C] 

= a v T» v , (10.27) 

where = W J/ d l '4> — g^ v C is a quantity that is known as the energy- 
momentum tensor. The conserved charges that arise from this current 
can be written 

P a = I d 3 xT 0a . (10.28) 

Thus the time-like component of this conserved charge is 


symmetry generator 



pO 



d 3 a; [ 7 t ( x ) 4 >( x ) — £(x)] 



Fig. 10.3 Noether’s theorem for the 

(10.29) field ^ 


which we recognize as the energy. The space-like components give us 


pk 



d 3 x / K[x)d k (j)(x), 


(10.30) 


which we recognize as the momentum. (Note that g 00 = 1 and g ok = 0.) 


Example 10.3 

The energy-momentum tensor T^ u is not uniquely defined. If one adds a term 
d\X x ^ u to , where = —X^ Xu , then show that the new tensor is still 

divergenceless. 

Solution: d^(T^ +d\X x i 1 - v ) = + d l _ l d\X x ' J - v = 0 + 0 = 0. This works because 

(i) = 0 because T is a conserved current and (ii) d l _ l d\X Xll ’ v = 0 follows 

from the antisymmetry of X Xfll/ w ith respect to swapping the first two indices and 
the fact that you are summing over botli indices // and A. 


The energy-momentum tensor , as we have constructed it, is not 
symmetric, and that turns out to cause problems when it is used in 
general relativity, 7 but we can use the trick in the previous example 
to redefine T^ —> T^ + d\X Xtlv and force it to be symmetric. This 
symmetric nature of is also important when we come to look at 
Lorentz transformations. 


Example 10.4 

We will extract some information from Tc 1 '. For an infinitesimal Lorentz 
transformation 8 we can write 


7 r> 

Einstein’s field equations are 

R R-- Sn °T 

where R^ v is the Ricci tensor, express¬ 
ing the curvature of spacetime, and R 
is the Ricci scalar. For further details, 
see any good book on general relativity, 
e.g. Penrose (2004). 

^See Exercise 9.3. 


Sx^ = cj^ u x u 8 A, 


(10.31) 


where u is an antisymmetric tensor. Hence the change in field cf) is given by 

8(f) = 8x^d^4> = tj^ u x u d^(f)8X, (10.32) 
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^The second equality in eqn 10.34 fol¬ 
lows from the following argument. Dif¬ 
ferentiating the product x° £ gives 

d p (x a c) = x a d p c + ca p x a , 

and because d p x a = 5 p(T , the antisym¬ 
metry of ujpa implies that 

uj pa d p x a = u pa 5 pa = 0 . 

Hence 

(jjpoX a d p C = l o p( jd p {x <T C) 

= d p (uj p(7 x a £). 

10 We have turned (J p ) pCT into (J p ) pCT 
using (J p ) pa = (j p )P a - (J p yP. 

11 These are Noether conserved charges, 
but we drop subscript N’s here to avoid 
clutter. 


and so 

D<f) = uj^Xudpfj) = u)p V x v d' x <$> (10.33) 

so that the effect on the Lagrangian density £ is 9 

D£ = co p(T x (7 d p £ = d p up (7 x (7 £ = dp[g pp ujpaX <T £\ = dpW p , (10.34) 


where = gPPcjp^x^£. Thus we can go straight to our conserved current 
J p = n p D(f)-W p 

= UJpcr [UP-x^dP^ - g pp x G £\ 

= LO P (yX G T pp 

= ujp a (Jp) pa , (10.35) 

where ( JP) pCT = x a T pp . There are six parameters in u) pCT (remember that it is 
antisymmetric) and so there are six conserved currents. We will write 


( J P Y G = x p T p(T - x a TP p , (10.36) 

where the currents have been antisymmetrized 10 compared with eqn 10.35. These 
are conserved currents and so dp(J p ) pcr = 0, although this only works because T pu 
is symmetric. The conserved charges 11 are 

Q X P = J d 3 x(J X ) p0 , (10.37) 

from which we can extract the angular momentum components 

0» = /AKT>»-,.T-) (10.38) 

and the boosts 

Q 0i = I d 3 x (x°T i0 — x i T 00 ). (10.39) 

Note that since dQ° l /dt = 0 and x° = t, then 

0 = J d 3 xT i0 +t J d 3 xt i0 -^J d 3 xx i T 00 = pt+tP 1 -^ J d 3 xx i T 00 , (10.40) 

and hence ^ f d 3 xx l T 00 is a constant, showing that the centre of mass of the system 
moves in uniform motion. 


10.4 Other symmetries 

We have seen how translations, rotations and boosts can transform 
spacetime coordinates and also Lagrangians. However, the only effect 
on the fields has been due to the fact that the fields are functions of the 
spacetime coordinates. A field, you will remember, is a machine that 
when fed a coordinate x^ returns a number <p(x^). What we have been 
considering is a scalar field, but of course when the field is more compli¬ 
cated (for example, a vector or a spinor) then we have to worry about 
how the symmetry transformation affects the field itself, in addition to 
the effect on the spacetime coordinates of which the field is a function. 
That will lead to additional layers of complexity which we will save for 
later. One of the most exciting consequences of this is that if we have 
a complex scalar field we will be able to show that the U{ 1) symmetry 
we noticed in Section 7.6 leads to the conservation of particle number. 
We will save that gem for Chapter 12, but before that, in the following 
chapter, we will outline a procedure for quantizing our field theories. 
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Chapter summary 

• Noether’s theorem states that if a continuous symmetry transfor¬ 

mation <f> —> <f> + Dcj) only changes C by the addition of a four- 
divergence (i.e. DC = d^W^) for arbitrary </>, then this implies the 
existence of a current J ^ = U^Dcp — W ,i {x) and if </> obeys the 
equations of motion then the current is conserved, i.e. = 0. 

• In short, Noether’s theorem states that continuous symmetries lead 
to conserved currents. 


Exercises 


(10.1) Show that [4>(x),P a \ = i d a <p{x), where P a is 
the conserved charges from spacetime translation 
(eqn 10.28). 

(10.2) Consider a system characterized by N fields 

<f>i The Lagrangian density is then 
C(4> i,..., 4>n ; i, • • •, d^cpi v; x M ). Show that the 

Noether current is 


show that 

IT P = 


dC 


= -F cjp 


(10.44) 


d{8 a A p ) 

Hence show that the energy-momentum tensor 

TC = n ^d„A a - 5 P C, (10.45) 


J M = - W*(x), (10.41) 

a 

where DC = 

(10.3) For the Lagrangian 

C = \ ( d P cj>) 2 ~ \m 2 <t> 2 , (10.42) 

evaluate T pl ' and show that T 00 agrees with what 
you would expect from the Hamiltonian for this La¬ 
grangian. Show that = 0. Derive expres¬ 

sions for P° = fd 3 xT 00 and P { = fd 3 xT 0i . 

(10.4) For the Lagrangian 

C = -^F pu F pu = ^(E 2 -B 2 ), (10.43) 


can be written as 

m -F^d v A a + F al3 F aP . (10.46) 

This tensor is not symmetric but we can symmetrize 
it by adding d x X Xttv where X x = F px A v . Show 
that X Xpl ' = —Show further that the sym¬ 
metrized energy-momentum tensor T M1 ' = T # “' + 
d x X x ^ can be written 

T pv = F^F" + F a/3 F a0 . (10.47) 

Hence show that T°° = \{E 2 + B 2 ), the energy 
density in the electromagnetic field, and T l ° = 
(E x B) z , which is the Poynting vector and de¬ 
scribes the energy flow. 
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1 Field operators were introduced in 
Chapter 4. 


You Ve got to have a system 
Harry Hill (1964- ) 

Quantum field theory allows us to consider a Universe in which there ex¬ 
ist different, yet indistinguishable, copies of elementary particles. These 
particles can be created and destroyed by interactions amongst them¬ 
selves or with external entities. Quantum field theory allows us to de¬ 
scribe such phenomena because particles themselves are simply excita¬ 
tions of quantum fields. 

To see how particles emerge from fields we need to develop a way 
to quantize the classical fields we have looked at previously. The good 
news is that there is a machine available that takes a classical field the¬ 
ory and, after we turn the handle, it spits out a quantum field theory: 
that is, a theory where quantities are described in terms of the number 
of quantum particles in the system. The name of the machine is canon¬ 
ical quantization. In developing canonical quantization we’ll see that 
particles are added or removed from a system using field operators 1 
and these are formed from the creation and annihilation operators we 
found so useful in previous chapters. 


11.1 The canonical quantization machine 

Canonical quantization is the turn-the-liandle method of obtaining a 
quantum field theory from a classical field theory. The method runs like 
this: 

• Step I : Write down a classical Lagrangian density in terms of 
fields. This is the creative part because there are lots of possible 
Lagrangians. After this step, everything else is automatic. 

• Step II: Calculate the momentum density and work out the 
Hamiltonian density in terms of fields. 

• Step III: Now treat the fields and momentum density as opera¬ 
tors. Impose commutation relations on them to make them quan¬ 
tum mechanical. 

• Step IV: Expand the fields in terms of creation/annihilation op¬ 
erators. This will allow us to use occupation numbers and stay 


sane. 
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• Step V: That’s it. Congratulations, you are now the proud owner 
of a working quantum field theory, provided you remember the 
normal ordering interpretation. 


We’ll illustrate the method with one of the simplest held theories: the 
theory of the massive scalar held. 

Step I: We write down a Lagrangian density for our theory. For 
massive scalar held theory this was given in eqn 7.7, which we rewrite 
as 

C=-[d li <f>(x)] 2 --m 2 [<l>(x)] 2 . ( 11 . 1 ) 

[Recall that the equation of motion for this theory is the Klein-Gordon 
equation ( d 2 + m 2 )(j) = 0 leading to a dispersion E 2 = p 2 + m 2 .} 

Step II: Find the momentum density (eqn 10.10) W'(x) given by 


IF(a;) 


dC 

9(d^<j>{x ))' 


( 11 . 2 ) 


For our Lagrangian in eqn 11.1 this gives 11 ^( 2 ;) = d^^x). The time-like 
component 2 of this tensor is II 0 (x) = tt(x) = d°<j)(x). This allows us to 
dehne the Hamiltonian density in terms of the momentum density 


H = U°{x)d 0 (j){x)-C, (11.3) 

and using our Lagrangian 3 leads to a Hamiltonian density 
H = d° 0(x)do<?>(x) — C 

= \ [doHx)} 2 + \ [V<Ha ;)] 2 + ^m 2 ^{x)] 2 . (11.5) 

The last line in eqn 11.5 tells us that the energy has contributions from 

(i) a kinetic energy term reflecting changes in the configuration in time, 

(ii) a ‘shear term’ giving an energy cost for spatial changes in the held 
and (iii) a ‘mass’ term reflecting the potential energy cost of there being 
a held in space at all. Taken together this has a reassuring look of 
E = (kinetic energy) + (potential energy), which is what we expect from 
a Hamiltonian. 

Step III: We turn helds into held operators. That is to say, we make 
them operator-valued fields: one may insert a point in spacetime into 
such an object and obtain an operator. We therefore take (j>(x) —> cj)(x) 
and n H (a:) —> Ft 0 (a:). To make these held operators quantum mechanical 
we need to impose commutation relations between them. In single¬ 
particle quantum mechanics we have [x,p] = i h. By analogy, we quantize 
the held theory by defining the equal-time commutator for the held 
operators 

[j)(t,x),fl°(t,y)\ = iS (3) (x - y). (11.6) 

As the name suggests, this applies at equal times only and otherwise 
the fields commute. We also have that [</>(x) , = [II 0 (a;), fl°( 2 /)] = 0 

(and likewise for the daggered versions). Expressed in terms of these 
fields, the Hamiltonian density l~i is now an operator Id which acts on 


2 The metric (H--), which allows 

us to say A 0 = Ao, lets us swap up 
and down indices for the zeroth com¬ 
ponent of any object without incurring 
the penalty of a minus sign. 

Q 

which may be helpfully rewritten as 

(11.4) 
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state vectors. This is all well and good, except that we don’t know how 
operators like 4>{x) act on occupation number states like \n1n2n3 ...). 
What we do know is how creation and annihilation operators act on 
these vectors. If only we could build fields out of these operators! 

Step IV: How do we get any further? The machinery of creation and 
annihilation operators is so attractive that we’d like to define everything 
in terms of them. In particular, we found a neat analogy between par¬ 
ticles in momentum eigenstates and quanta in oscillators. What we do, 
therefore, is to expand the field operators in terms of the creation and 
annihilation operators. We’ve already seen this in the coupled oscilla¬ 
tor problem (eqn 2.68) where we obtained a time-independent position 
operator of the form 


E 


(2 u k N) 


r(4e ijfea 


a\e- iika ) 


(11.7) 


We are rewriting our momenta k —> p 
and our energies —► E p , so the 

factor(2cjfe) 1 / 2 —> (2 Ep) 1 / 2 . The nor¬ 
malization will be discussed below. 


By analogy we write down a time-independent field operator for the 
continuous case which will look like 



d 3 p 1 

(2tt)I (2 E p )h 


( a v e 


ipx + ate-iP' 3 ’) 


( 11 . 8 ) 


with E p = +(p 2 +m 2 )3 (that is, we consider positive roots only, a feature 
discussed below) and where, as before, our creation and annihilation 
operators have a commutation relation [d p , a^} = S f:i> (p — q). 

The field operators that we’ve written down are intended to work in 
the Heisenberg picture. To obtain their time dependence, we hit them 
with time-evolution operators. We use the Heisenberg prescription for 
making an operator time dependent: 


<j>{x) = (j>(t, x) = &(t, 0)(j>(x)U(t, 0) = e lEt (j>{x)e 1 ^ t . (11.9) 


The only part that the U(t, 0) = e lHt operators affect are the creation 
and annihilation operators, and we have 


(t, 0)a p U(t, 0) = e~ iEpt a p , (11.10) 

which shows that the ci p picks up a e~ lEpt . Similarly, the a) p part will 
pick up a e lEpt . 


Example 11.1 

To see this unfolding we’ll consider a simple example of one component of the field 
acting on an example state. The component we’ll consider is a q e iq x and the example 
state is \npn q n r ). Taking it one step at a time: 

e i6t a q e iqx e- iAt \n p n q n r ) 

= e i6t d q \n p n q nr)e iq *e- i< - npEp+n "> E <‘ +npEp)t 
= - 1 )n r )e iq a! e- i(n P Ep+rl i E ‘> +npEp)t 

= y/n^\n p (n q - l)n r ) e d n P E P + ( n <l~ 1 ) E ‘l+ n r E -r)t ( f<I * e -i("p£p+»,B,+»r£r)i 
= x/nfi\n p (n q - l)n r )e~ lEc i t e lq ' x . (11.11) 
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Part of what we’re left with, namely *Jn q \n p (n q — 1) n r ). is exactly the same as if 
we’d just acted on the original state with a q . The effect of dynamicizing this operator 
has just been to multiply by a factor e — lE i t J so we conclude that the operator we 
seek is a q e~^ E i t ~ qx ) = a q e~ lq x . 


In summary, what 
given by 4 

we call the mode expansion of the scalar field is 

4>{x) = J 

^ P 1 (n r~'P-x 1 atpip-a’j 

(11.12) 

(271)1(2 E p )h [apG +ap& h 

with E p = +(p 2 + m 2 )3. 


Note that by expanding out the position field we’ll get the momentum 
expansion for free, since for our scalar field example II Al (a:) = d^(f>(x). 
Also note that because the field in our classical Lagrangian is a real 
quantity, the field operator <f>(x) should be Hermitian. By inspection 
= <f(x), so this is indeed the case. 


11.2 Normalizing factors 

Before proceeding, we will justify the normalization factors in eqn 11.12. 
In evaluating integrals over momentum states we have the problem that 
cl 3 p is not a Lorentz-invariant quantity. We can use d 4 p where p = (p°,p) 
is the four-momentum, but for a particle of mass m then only values of 
the four-momentum which satisfy 5 p 2 = mr need to be considered. This 
is known as the mass shell condition (see Fig. 11.1). Consequently 
we can write our integration measure 

d 4 pS(p 2 — m 2 ) 0(p°). (11.13) 

We have included a Heaviside step function 6 (p°) to select only positive 
mass particles. 


Example 11.2 

Show that S(p 2 — m 2 ) 0(po) = ^— 
We use the identity 

S[f(x)} = 


5(p 0 - E p )0(p o ). 


E 

C 1/0=0} 


1 

I/'Or) | 


<5(a:), 


(11.14) 


where the notation tells us that the sum is evaluated for those values of x that 
make f(x) = 0. We take x = p° and f(p°) = p 2 — m 2 = ( p °) 2 — p 2 — m 2 . This 
gives us that |/ / (p°)| = 2|p°| and we use the fact that the zeros of f(p°) occur for 
p° = =b(p 2 + m 2 ) 2 = d zE p to write 


8(p 2 - to 2 ) 0(po) = [<5(po - E p )6(p 0 ) + 5(p 0 + E p )6(p 0 )], (11.15) 

ZH/p 


and so the result follows (since the second term in eqn 11.15 is zero). 


4 This looks a little different from what 
we had in Chapter 4 when we intro¬ 
duced field operators. We will explain 
the reason for the difference in Sec¬ 
tion 11.5. 


This section can be skipped if you are 
happy to take eqn 11.12 on trust. The 
purpose here is simply to justify the fac¬ 
tors (27 t) 3 / 2 (2E p ) 1 / 2 which otherwise 
seem to appear by magic. 

°The condition p 2 = m 2 means that 
(P 0 ) 2 — p 2 = E 2 — p 2 = m 2 . 


E=p° 

A 



Fig. 11.1 The mass shell p 2 = m 2 is a 
hyperboloid in four-momentum space. 
Also shown is the equivalent surface for 
light, p 2 = 0. 
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Thus we will write our Lorentz-invariant measure as 

d 3 p 

(2tt) 3 2 E p 


(11.16) 


where the additional factor 5(po — E p )9(p 0 ) is there in every calculation 
and so we suppress it, and we have included the factor l/(27r) 3 because 
the mode expansion is essentially an inverse Fourier transform (we have 
one factor of 1/( 2 - 7 t) for every component of three-momentum). We are 
now in a position to write down integrals, for example: 


1 = 


d 3 p 

{2tt) 3 2E p 


\P)(P I- 


(11.17) 


This requires us to have Lorentz-covariant four-momentum states | p). 
We previously normalized momentum states according to 


(p\q) = 6 {3 \p- q), (11.18) 


so our new four-momentum states | p) will need to be related to the 
three-momentum states |p) by 

|p) = {2nf/\2E p ) 1 /' 2 \p), (11.19) 

and then their normalization can be written 

(j>\q) = (2n) 3 2E p 6W(p-q). (11.20) 

Similarly, to make creation operators or appropriately normalized so 
that they create Lorentz-covariant states, we must define them by 

4 = {2irf/\2E p )^al, ( 11 . 21 ) 

so that dj,|0) = |p). In this case our mode expansion would take the 
form of a simple inverse Fourier transform using our Lorentz-invariant 
measure 

+ ( 11 . 22 ) 

or writing in terms of a' p and a p rather than a' p and a p : 

k *) = [ 7 ~T 3 , ot ! ,X (a P e~ ip x + a p e' p x ) , (11.23) 

J ( 2 tt )2 ( 2 E p )2 

which is identical to eqn 11 . 12 . 


11.3 What becomes of the Hamiltonian? 

We can now substitute our expansion of the field operator <j>(x) into 
the Hamiltonian to complete our programme of canonical quantization. 
This will provide us with an expression for the energy operator in terms 
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of the creation and annihilation operators. The Hamiltonian is given by 
the volume integral of the Hamiltonian density, 


H = [ d 3 x l 


do (j>{x) 


[V<f>(x)] 2 + m 2 [(j)(x)f 


(11.24) 


All we need do is substitute in the mode expansion and use the com¬ 
mutation relations to simplify. We’ll start by computing the momentum 
density n M (:r) = d p (l>{x) which is given by 


n„(a;) = d p (j){x) = 


d 3 p 


tt(- ip») (a p e~ ip x — a^ p e ip x ) . 


( 2 n)H 2 E p )2 

(11.25) 

To obtain an expression for do4>(x), simply take the time-like component 
of H p (x): 


d 0 <t>(x) = 


d 3 p 


i ( i E p ) (ci p e 


~ ipx - aj,e ip ' x ) , 


(2tt)H 2E p )2 

while the space-like components provide us with V0(a;): 
d 3 p 


(11.26) 


V</>(x) = 


~x(ip) (a p e~ ip ' x — a) p e' p ' x ) . (11.27) 


(2tt)H 2E p )2 

We now have all of the ingredients to calculate the Hamiltonian. 


Example 11.3 

Substitution of the mode expansion leads to a multiple integral of the form 
d 3 x d 3 pd 3 g 


H = - 


II 


- [(-E p E q - p ■ q)[a p e~ ip ‘ x - oje 1 *"*] 


(2n) 3 (2E p ) 2 (2E q ) 2 
X [d q e~ iqx - ate 19 ' 31 ] + m 2 [a p e~ ip - x + ale ipx ][a q e- iqx + ^e 1 ' 3 "*]] . (11.28) 

We use our favourite trick again: first, we integrate over x and use j d 3 xe lpx = 
(2ir) 3 S( 3 l (p) and obtain 


H = - 


II 


d 3 pd 3 </ 


(11.29) 

(2E p )i (2E q )i 

x [<d 3 )(p- q)(E p E q +p • qf + m 2 )[at a g e -i( s p -B q )i + a p d ] q e~ i(E v~ E <i'> t } 

+ S (3) (p + q)(-E p E q - p - q + m 2 )\a' p a] l e~^ Ep+E, ‘' >t + d p d q e~^ Ep+Eq,t ^ . 
Then we do the integral over q to mop up the delta functions: 

H = \ J d3 P-^r [( e p +P 2 +™ 2 )(4“i> + «pflp) 

+ (~E p + p 2 + m' 2 )(a^ p a, t _ p e 2lE P t + d p a- p e~ 2lE f t )j , (11.30) 

and since = p 2 + m 2 this quickly simplifies to 

i J d 3 pE p (d p d ] p + d\,d p ). (11.31) 


H = - 
2 . 
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Using the commutation relations on the result in eqn 11.31 we obtain 

E = J d 3 pE p (ala p +^ 3 \0)^ . (11.32) 

The last term should fill us with dread. The term | f d 3 p<5 3 (0) will give 
us an infinite contribution to the energy! However we should keep in 
mind that by evaluating the total energy E we’re asking the theory to 
tell us about something we can’t measure. This is a nonsensical question 
and we have paid the price by getting a nonsensical answer. What we 
can measure is differences in energy between two configurations. This 
is fine in our formalism, since upon taking the differences between two 
energies the |cn 3 l(0) factors will cancel obligingly. However, it’s still 
rather unsatisfactory to have infinite terms hanging around in all of our 
equations. We need to tame this infinity. 


11.4 Normal ordering 


To get around the infinity encountered at the end of the last section we 
define the act of normal ordering. Given a set of free fields, we define 
the normal ordered product as 


N 


abc ] 


Operators rearranged with all 
creation operators on the left 


(11.33) 


6 The origin of the infinity in eqn 11.32 
is due to an ordering ambiguity in the 
classical Lagrangian. Consider a gen¬ 
eral classical theory formed from com¬ 
plex values fields ^(x). The Lagrangian 
describing these fields will contain bilin¬ 
ear terms like which could equally 
well be written in a classical theory as 
-ipip'. When we quantize and insert the 
mode expansion the order that was cho¬ 
sen suddenly becomes crucial. Normal 
ordering removes the ambiguity to give 
a meaningful quantum theory. 


The N is not an operator in the sense that is doesn’t act on a state to 
provide some new information. Instead, normal ordering is an interpre¬ 
tation that we use to eliminate the meaningless infinities that occur in 
Held theories. So the rule goes that ‘if you want to tell someone about 
a string of operators in quantum field theory, you have to normal order 
them first’. If you don’t, you’re talking nonsense. 6 

Rearranging operators is fine for Bose fields, but swapping the order 
of Fermi fields results in the expression picking up a minus sign. We 
therefore need to multiply by a factor (—l) p , where P is the number of 
permutations needed to normally order a product of operators. 


Example 11.4 

Some examples of normal ordering a string of Bose operators follow. We first consider 
a single mode operator 

N j^acdj = a) a, N [(dal = a)a, (11.34) 


and 

N ^cddadtcdj 

Next, consider the case of many modes 

N ^a p a^d r j 


= a' a' a' aa. 


— afjCLpCLj*. 


(11.35) 

(11.36) 


The order of a v and d r doesn’t matter since they commute. 

For Fermi fields, N[dpCqCr] = —CqC p c r , the sign change occurring because we have 
performed a single permutation (and hence an odd number of them). 
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Step V: We can finally complete the programme of canonical quantiza¬ 
tion. With the normal ordering interpretation the old expression 

H = - J d 3 p E p (a p a p + a p d p ), (11.37) 

becomes 

N[H] = 1 J d 3 pE p N[a p d\, + a\,a p ] 

= — J d pE p 2d p a p 

= J cl 3 pE p n p , (11.38) 

where n p = ad p ci p is the number operator. Acting on a state it tells you 
how many excitations there are in that state with momentum p. 

We now have a Hamiltonian operator that makes sense. It turns out 
that the Hamiltonian for the scalar field theory is exactly that which we 
obtained for independent particles in Chapter 3. This isn’t so surprising 
since we started with a Lagrangian describing waves that didn’t interact. 
What we’ve seen though, is that the excited states of the wave equation 
can be thought of as particles possessing quantized momenta. These 
particles could be called scalar phions. 7 They are Bose particles with 
spin 8 5 = 0. 


Example 11.5 

We have claimed that the vacuum energy is unobservable, so may be ignored. How¬ 
ever changes in vacuum energy terms are physically significant and lead to measurable 
effects. Such a change results if you adjust the boundary conditions for your field 
and this is the basis of the Casimir effect. 9 This is a small, attractive force be¬ 
tween two closely spaced metal plates, which results from the vacuum energy of the 
electromagnetic field. The essential physics can be understood using a toy model 10 
involving a massless scalar field in one dimension. 

Consider two metal plates I and II separated by a distance L. We put a third 
plate (III) in between them, a distance x from plate I (see Fig. 11.2). We will derive 
the force on plate III resulting from the field on either side of it. The presence of the 
plates forces the field to be quantized according to k n = nn/x or nir/(L — x). The 
dispersion is E n = k n and so the total zero-point energy is given by 




1 

- 1 

/ 717T ^ 

| + 1 | 

( niv \ 

2 

V X J 

2 ' 

KL-x)_ 


= /(+) + f(L - x), 


(11.39) 


that is, ftoj n per mode. 11 These sums both diverge, just as we expect since we are 
evaluating the infinite vacuum energy. 

However, real plates can’t reflect radiation of arbitrarily high frequency: the high¬ 
est energy modes leak out. To take account of this we cut off these high-energy modes 
thus: 


(11.40) 


7171 717T 

2x 2x 

which removes those modes with wavelength significantly smaller than the cut-off 
a. The value of this cut-off is arbitrary, so we hope that it won’t feature in any 
measurable quantity. 


*7 

This is because they are excitations of 
the scalar field <j> (Greek letter ‘phi’). 

^ We shall see that spin information 
is encoded by multiplying the creation 
and annihilation operators by objects 
that tell us about the particle’s spin 
polarization. These objects are vectors 
for 5 = 1 particles and spinors for a 
5 = 1/2 particles. For 5 = 0 they do 
not feature. 

^The effect was predicted by the Dutch 
physicist Hendrik Casimir (1909-2000). 


1 This approach follows Zee. A more 
detailed treatment may be found in 
Itzykson and Zuber, Chapter 3. 



Fig. 11.2 Two metal plates (I and II) 
separated by a distance L, with a third 
plate (HI) inserted in between. 

Remember, for photons c o n = ck n , 
and the zero-point energy is ^fku ni so in 
units in which h = c = 1 this becomes 
\k-n-, and so E = Equa¬ 

tion 11.39 contains two such sums. 
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The sums may now be evaluated: 


/(*) = 


E n7r „ — mra/x 
2x 


1 a 

2 da 

1 d 


£« 


-nit a/x 


1 


2 da 1 - e* a / x 

7T Qita/x x 7T 

2a: (1 — e 7ra / :E ) 2 27ra 2 24a: 

The total energy between plates I and II is 

L 7T / 1 1 

24 U 


£ = /(*) + /(£ - *) = 


27ra 2 24 \ x L — x 

If x is small compared to L, then we find a force 

F = _^ = _ZL_ 

Ox 24x 2 ’ 


+ 0(a 2 ). 

(11.41) 

+ 0(a 2 ). 

(11.42) 


(11.43) 


which is independent of a, as we hoped. Thus, there is an attractive force between 
the closely spaced plates I and III. This is the Casimir force. We can understand 
this force intuitively by realizing that as the two plates are pulled together we lose 
the high-energy modes. This reduces the energy between the plates and leads to 
an attractive force. A more quantum-field-theory friendly interpretation is that the 
effect results from quantum fluctuations in the vacuum, in which particles are spon¬ 
taneously created and annihilated. These processes give rise to the vacuum Feynman 
diagrams described in Chapter 19. 


11.5 The meaning of the mode expansion 


In the next chapter we’ll turn the crank on our canonical quantization 
machine for the second simplest held theory that we can imagine: that 
of the complex scalar held. Before doing that, let’s have a closer look at 
the mode expansion for this theory. Our hrst guess for the held operator 
might have been the simple Fourier transform 


4> (z) 


d 3 p 1 

(2tt)I (2 E p )h 


a p e 


—i p-x 


(11.44) 


12 Why not? The result would be a field 
for which [(f>(x), 7 = 0 for space-like 

\x — y\. See Weinberg, Chapter 5 for 
the details. 


which looks like the one we had in Chapter 4, where the e~' px tells 
us that the particles are incoming. Unfortunately, this expansion won’t 
work for the relativistic theory we’re considering. The problem is the 
existence of negative energy solutions in the relativistic equations of 
motion: each momentum state gave rise to two energies E p = ±(p 2 +m 2 ) 
and we can’t just leave out half of the solutions. 12 Looking back at our 
discussion of the Klein-Gordon equation we saw the resolution of this 
problem, and we’ll employ the same solution here. 

What we need is a mode expansion that includes these negative energy 
modes 


<j>(x) = 


positive E p 
mode 


negative E p 
mode 


(11.45) 
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The expansion is carried out in terms of incoming plane waves e~ lp ' x . 
Now recall Feynman’s interpretation of the negative frequency modes in 
which negative energies are assumed to be meaningless, all energies are 
set to be positive and the signs of three-momenta for such modes are 
flipped. In this picture, the formerly negative energy states are inter¬ 
preted as outgoing antiparticles. The positive energy modes continue to 
represent incoming particles. Incoming here means, not only a factor 
e~ lp x , but also that the particle is annihilated: it comes into the system 
and is absorbed by it. Conversely, outgoing means a factor e lp x and 
that the particle is created. We therefore interpret the mode expansion 
as 



incoming positive E p 
particle annihilated 



outgoing positive E p 
antiparticle created 

(11.46) 


The resulting expansion of a held annihilation operator is then 


kx) =J£ <lL47) 

where the particle and antiparticle energies are given by E p = + (p 2 + 
m 2 ) 2 . The rules are that a p annihilates particles, a) p creates particles 
and the operators b p and E p respectively destroy and create antiparticles. 
(For our scalar field theory the particles and antiparticles are the same: 
we say that ‘each particle is its own antiparticle’. This means that a p 
can be thought of as either an operator that annihilates a particle or one 
that annihilates an antiparticle.) 13 


13 As we will see, in other theories we 
will need separate operators to perform 
these two distinct roles, but in scalar 
field theory there is no difference. Thus 
in this case a p = b p , because in a scalar 
field theory a particle is its own antipar¬ 
ticle: particles and antiparticles are the 
same. This will not necessarily be the 
case for other field theories, an example 
of which will be described in the next 
chapter. 


Example 11.6 

Let’s see what happens if we hit the vacuum state with our field annihilation operator. 
Since dp|0) = | p) we have 

H x )\0) = [ -^-re ip '*|p>- (11-48) 

J (2n)2(2E p )2 

We have therefore created an outgoing superposition of momentum states. Pick out 
one of these states and make an amplitude by folding in a relativistically normalized 
state (<?| = ( 2 tt) 2 ( 2 E p )2 (q| : 

(2rr) §(2£p) 5 (<#(*) 10) = J d 3 p e ip ' x (q\p) = J d 3 pe ip x 8^\q - p) = e iq x . 

(11.49) 

In the language of second quantization, e iq ' x tells us how much amplitude there is in 
the qth momentum mode if we create a scalar particle at spacetime point x. Note 
that e ipx = e i( E pt-p *) an( j so 

J d 3 pe ip x 8^\q -p) = = e iq x , (11.50) 

demonstrating once again that our integral over three-momentum coordinates nev¬ 
ertheless results in a single particle in a four-momentum state. 
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^or, for the complex valued fields, as 
bilinear in the field and its adjoint. 


Canonical quantization for a 
scalar field: 

I Write down a Lagrangian den¬ 
sity jC. 

II Evaluate the momentum den¬ 
sity U^(x) = dC/d(d P 4>) and 
the Hamiltonian density 

H = n°<9 O 0 - c. 


Ill Turn fields into operators and 
enforce the commutation rela¬ 
tion [<j>(x), fl 0 ( 2 /)] = i5( 3 )(aj— y) 
at equal times. 


IV Express the field in terms of a 
mode expansion of the form 


4>(x) = f 


d 3 p 


1 


(2tt)2 ( 2 E p )2 

x (a p e- ip a: + at e ip x 
where E p = +(p 2 + m 2 ) 2 . 


V Evaluate H and normal order 
the result, leading to an expres¬ 
sion like: 


f 


N[H] = / d s pE p a],a p . 


It is important to note that canonical quantization will not succeed 
in diagonalizing all field theories. Roughly speaking it works only for 
those Lagrangians which can be written as quadratic in a field and its 
derivatives. 14 (We will revisit the mathematics of this point in Chap¬ 
ter 23.) The result of canonical quantization is a system described by 
single particles in momentum states which don’t interact with each other. 
For this reason Lagrangians that may be canonically quantized are called 
non-interacting theories. In contrast, those theories which cannot be 
diagonalized with canonical quantization are called interacting theo¬ 
ries; these will be described in terms of single particles in momentum 
states which do interact with each other. Interacting theories are the 
subject of much of the rest of this book. For now we will continue 
to look at non-interacting theories and in the next chapter we examine 
some more uses of the canonical quantization routine that we have built. 


Chapter summary 

• Canonical quantization is an automated method for turning a clas¬ 
sical field theory into a quantum field theory. 

• Field operators are formed from mode expansions with amplitudes 
made of creation and annihilation operators. 

• Normal ordering is needed to make sense of quantum field theories. 

• The results of each step of the canonical quantization procedure 
for the scalar field theory are shown in the box in the margin. 


Exercises 


(11.1) One of the criteria we had for a successful theory 
of a scalar field was that the commutator for space¬ 
like separations would be zero. Let’s see if our scalar 
field has this feature. Show that 

(11.51) 

For space-like separation we are able to swap (y — x) 
in the second term to (x — y). This gives us zero, 
as required. 


(11.2) Show that, at equal times x° = y °, 

[«foO,n°(y)] = i J ^(e ip (*- a >+e- ip -(*- a >). 

(11.52) 

In this expression there’s nothing stopping us swap¬ 
ping the sign of p in the second term, and show that 
this leads to 

[<t>(x),fl°(y)] = iS (3) (x - y ). 


(11.53) 





Examples of canonical 
quantization 


12 


Examples are like cans of over-strength lager. One is both 
too many and never enough. 

Anonymous 

In this chapter we will apply the methods from the previous chapter to 
the complex scalar field. This theory illustrates several ideas that are 
very important in quantum field theory. We will also find the Noether 
current and finally, we will take the non-relativistic limit of the theory. 
This last point is particularly important for thinking about applications 
to condensed matter, where much of the physics is firmly in the non- 
relativistic domain. 
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12.1 Complex scalar field theory 


The Lagrangian for the complex scalar field 1 is an example of a the¬ 
ory with more than one component. Here we have two: ip(x) and ip^(x). 
We’ll now canonically quantize this Lagrangian using the procedure pre¬ 
sented in Chapter 11. 

Step I: We start by writing down the Lagrangian we want to quantize. 
The Lagrangian for complex scalar field theory is 

C = d tJ ’ip\x)d IJ ,ip(x) — m 2 ip^ (x)ip(x). (12.1) 

Step II: Identify the momentum density and write down the Hamilto¬ 
nian. Each component 2 o of the field (i.e. <j = ip and ip 1) has a different 
momentum density. The momentum of the -0-field and of the 0l-field are 
given, as usual by the time-like (zeroth) component of the momentum 
density nj): 


1 Recall where it came from. We con¬ 
sidered the addition of two scalar fields 

c = \ [d^Mz)] 2 - \ m2 [0i0)l 2 
+ ^ [ d M </> 2 ( z )] 2 - [^ 20)] 2 • 

We noticed that we can make a 
transformation that simplified this La¬ 
grangian. We defined 

= 75 [010) + 02 (a:)], 

^ = 75 [010) - i02 0)]. 

and we obtained the Lagrangian in 
eqn 12.1. This Lagrangian looks rather 
like that of the massive scalar field, but 
without the factors of 1/2. 


o 

QH Note carefully that <j isn’t a tensor 

= mo = 9°1p\ n° t = , = d °1p. (12.2) component, it simply labels the compo- 

d{OQ lp) d(Oo1p ) nent of the field: ip or ipt. This implies 

that n ^ = n£. 

Now that we have the momenta we can write the Hamiltonian. For the 
case of more than one field component we can write down 


= S ^ J V°Ax)d 0 ip°(x) -C 

G 

= d 0 ip\x)d 0 ip(x) + r) • Vip{x) + m 2 ip\x)ip(x). (12.3) 


U 
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Here we abbreviate 

d 3 p 1 


L = f- 


(2vr)2 (2Ep) 2 


Step III: Make the fields into quantum mechanical operators. We 
promote the fields to operators and impose equal-time commutation re¬ 
lations thus 






= i<5 (3) (a; - y), 


(12.4) 


and all other commutators vanish. 

Step IV: Expand the fields in terms of modes. For the case of the 
real scalar field, we found that the field operator 3 <j>{x) = f p (a p e~ lpx + 
cd p e lp ' x ), so that it is Hernritian [i.e. self-adjoint: <f\x) = <f{x)\. For our 
complex scalar field there is no reason why the field operator should be 
Hernritian. The appropriate mode expansion of the complex scalar field 
operators take the more general form 


lf{x) = j 

f d 3 p 1 

[ia p e~ ip ' x + ^e ip ' x ) 

(2tt)§ (2 E p )i 

ft(x) = j 

f d 3 p 1 , 

+ b p e ~ ipx ) 

(2tt)§ (2 E p )h 


where E p = + (p 


and b p annihilate 


m 2 )?. Here the operators a p 
two different types of particle. They satisfy the commutation relations 
[d P ,dq] = b p ,E q = S (3 > (p — q ) and all others combinations vanish. 
Substituting these into the Hamiltonian we find that, after some algebra 
and then (Step V) normal ordering, that we obtain: 



( 12 . 6 ) 


where n' p ' > counts the number of a-particles with momentum p and, sim¬ 
ilarly, h p counts the number of 6-particles. We notice that the a and b 
particles have the same energy E p , and so we interpret a and b as par¬ 
ticles and antiparticles respectively. This is justified in the next section. 
Thus if(x) involves a sum over all momenta of operators that annihilate 
particles (a p ) and create antiparticles (E p ). We therefore understand 
these new operators as follows: 


• a) p is the creation operator for a particle with momentum p, 

• w p is the creation operator for an antiparticle with momentum p. 

We’ve quantized the theory and we’ve found that the excitations of the 
field are scalar particles and antiparticles. 

It is worthwhile justifying how we have been able to treat if and if ^ 
as if they are independent fields. We began with 


V’(z) = 


and if\x) =-^[(f^x) - i<f 2 (x)\, (12.7) 
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and this implies that 


Oi 


—={if + if ] ] and (j> 2 
v 2 


72 


bP - 7 ]. 


( 12 . 8 ) 


Thus all we have done in moving from a description based on <p i and </>2 
to one based on if and if' is to perform a change of basis. Allowing the 
action S to vary with respect to if and -i/d gives as much freedom as if 
we were varying <fi and </> 2 - For example, we can adjust </>i alone using 
Sip = Sif t and we can adjust <p 2 alone using Sip = —Sift. 


12.2 Noether’s current for complex scalar 
field theory 

The complex scalar field has an internal 17(1) symmetry. 4 This means 
that the global transformations of the fields 

ip —> e ia if, ipi —> e~ ia if lf (12.9) 

have no effect on the Lagrangian. Remember that Noether’s theorem 
says that every symmetry begets a conserved current. This is our first 
opportunity to see what is conserved for an internal symmetry. In fact, 
we will see that the conserved Noether charge is particle number. 

To get to Noether’s current for the internal t/(l) symmetry, it’s easiest 
to write out the transformation for an infinitesimal change in the phase 
a: 

if -> if + iif 6a, Dip =+iip, . . 

V>t _► V’t - i ipUa, Dip 1 = -i If 1 '. 1 ’ 

Note that DC = 0 and since DC = this implies that we can take 5 

W IJ; = 0. The Noether current is therefore given by j£ = n^Ucr, 
(because W 11 = 0) where the index a again labels the component of the 
field: in this case whether it’s a = if or if'. As a result we have 

•7 = J2 U ^D<J = U^Dif + U^D^ 

(J 

= i [(9^1) if — (d^if)if^~\ . (12.11) 

We may make this a Noether current operator by upgrading all of our 
fields into field operators. This creates an ordering ambiguity in writing 
the fields and their derivatives. We will remove this ambiguity by normal 
ordering. Substituting the mode expansion for the fields allows us to find 
the conserved charge operator Qn- 


Example 12.1 

The charge operator is 

Q n = J d^jg = i [(d 0 ^) $ ~ (d°if)^] . 


^ Internal symmetries are discussed in 
more detail in the next chapter. 


c 

The property = 0 turns out to be 
true of all internal symmetries. 


( 12 . 12 ) 
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Of course, there’s the ordering ambiguity here, but we’ll press on and insert the mode 
expansion to yield 

Qn = \ f d3 P ( _a p“p + Mp - “p“p + ~ b pi>p) ■ (12.13) 

Finally, we normal order to give a conserved charge of 

N i^ N ] = / d3p ( b P bp ~ S p“p) ' (12.14) 


The conserved charge is given by the difference in the number of an¬ 
tiparticles Up' 1 and the number of particles n£\ This reason for this is 
quite simple. Particles, which are excitations in the field, carry Noether 
charge of one sign. To have a conserved charge we need other sorts of 
particle excitation carrying negative charge to cancel out the contribu¬ 
tion from the particles. We are left with the conclusion that the reason 
for the existence of antiparticles is that charge wouldn’t be conserved if 
they didn’t exist. 

Obviously, if a current is conserved, then so is — and so our 
assignment of which excitations carry positive Noether charge and which 
are negative is quite arbitrary. It is only when we attach some observable 
property, such as electric charge q, to each that we decide which is which. 
By convention, the number current J^ c is defined to be positive for 
particles (created by a' p operators) and negative for antiparticles (created 

by w p operators) and so we will define 6 j£ c = —N J[ I) , Qn c = / d 3 x J^ c 
and therefore Qn c = — AT[Qn]- Hence using eqn 12.14 and recalling that 
fip^ = ajptip and hp ' = b\,b p , we can conclude that 

Qnc = J d 3 P (n^ - n ( p b) J . (12.15) 

12.3 Complex scalar field theory in the 
non-relativistic limit 

In the final part of this chapter, we will road test our new complex scalar 
field theory and see how it works out in the non-relativistic limit and 
whether or not it will regenerate the familiar results of non-relativistic 
quantum mechanics. 

In the non-relativistic domain the excitation energies of particles are 
small compared to the particle mass contribution me 2 , where here we’ll 
temporarily reinstate factors of c and h. This means that the mass term 
E p= o = me 2 provides by far the largest fraction of the energy of an 
excitation, that is E = me 2 + e, where s is small. To make a theory 
non-relativistic, the strategy is to replace the relativistic held <j> with 

-» ^(x,t)e-' mc2t/h , 

allowing us to factor out the enormous rest energy. 


Noether’s theorem: J7(l) internal 
symmetry 

Dr/; = h/> Dip^ = —i r/4 

n^ t = a^v 

DC = 0 W* = 0 

= j [(a'> t )</’ — (a^vdvd] 

Qnc = — n [q n ] 

= f d 3 p(h^ - rip •*). 


6 With a subscript c for conventional, 
just like conventional current direction 
in circuit theory! 


( 12 . 16 ) 
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Example 12.2 

To get an idea of how this works, we’ll take the non-relativistic limit of the Klein- 
Gordon equation. We start (in natural units) with 

(d 2 + m 2 )*(x, = 0, (12.17) 


and moving to unnatural units we have 

a 2 


h 2 —- - h 2 c 2 V 2 + m 2 c 4 ) ^(as, l)e - 
dt 2 


2 t/n _ 


= 0 . 


The first term on the left yields 

h 2 ^-<K(x,t)e- imc2t / h = h 2 ( 
dt 2 V 


d 2 ^ 


2ime 2 imc 2 t/ ft 

h dt h 2 ) 


(12.18) 

(12.19) 


and substitution gives us 

h 2d -^r - 2imc 2 h— - h 2 c 2 V 2 ^ = 0. (12.20) 

dt 2 dt 

Lastly we notice that the first term, with its lack of factors of c 2 , will be much smaller 
than the other two. We can therefore drop the first term and we end up with 

= * (x,t). (12.21) 

dt 2m 

The result is, of course, that we recover the Schrodinger equation for a free particle. 


To take the non-relativistic limit of the complex scalar field Lagrangian 7 
we substitute in ib = —! = c -lrnt 'I' (with an extra normalization factor 

V 2m v 

1/ \J2m added with malice aforethought). 


7 

We’ll use the full interacting form 
C = {x)df X 'ip(x) — m 2 ^ (x)'ijj(x) — 
g[ip^(x)ip(x)] 2 and return to our con¬ 
vention that h = c= 1. 


Example 12.3 

As in the last example, the guts of taking the limit may be found in the time deriva¬ 
tives. We find 

= T- [do’^do'P - i m (Vdo'f - (d 0 ^ t )^) -fm 2 ^^] . (12.22) 

The first term going as 1/m is negligible in comparison with the others. The third 
term cancels against the mass term in the original Lagrangian. The dynamic part 
of the theory is therefore contained in the second term. We can see that a time 
derivative of (which we may expand in plane wave modes as something like 
^2 a p e~ ip ' x ) will bring down a factor —i E p . Since \ld is obtained from \I/ through 
taking an adjoint, the time derivative of will therefore involve bringing down a 
factor i E p . The result is that ('iddo'I' — may be replaced 8 by — 2'lddo'L. 


After these manipulations, we end up with 


^We could have replaced this with 
2'L(do'L^) if we had chosen modes vary¬ 
ing as e+ ip ' x , but we choose to favour 
matter over antimatter with our choice. 


£ = • Vtf(s) - ^(aOtffa)] 2 , (12.23) 

2m 2 


with g = A/2 m 2 . This looks a lot less neat and covariant than the 
relativistic form, because of the asymmetry between the matter and an¬ 
timatter fields that resulted from the choice we made in simplifying the 
time derivative term. This is acceptable because life is a lot less sym¬ 
metrical in the non-relativistic world, but we will see the consequences 
later. Anyway, we will now continue and feed this Lagrangian into the 
canonical quantization machine. 
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In this example we begin to see some 
of the limitations of canonical quantiza¬ 
tion. For a general potential V(x) the 
procedure does not return a diagonal¬ 
ized Hamiltonian (although it will work 
for a constant potential V(x) = Vo). 
We will see in Chapter 16 how more 
general potentials may be dealt with us¬ 
ing perturbation theory. 


Example 12.4 


Let’s attempt to quantize the non-relativistic Lagrangian, without the self-interaction 
(g = 0), but in the presence of an external potential V(x) which enters as VTT'L. 
There’s nothing for it but to press on with the five point programme. 

Step I: The Lagrangian density with the external potential is given by 

C = i'£ t (x)a 0 'I r (x)-— WLx) • V’P(x) - VYad’S't (x)'I'(x). (12.24) 

2m 

Feeding this through the Euler-Lagrange equations yields the Schrodinger equation 
and, for V(x) = 0, a dispersion E p = (Exercise 12.5). This only has positive 
energy solutions so we might expect that the negative frequency part of the mode 
expansion won’t be needed here. This will turn out to be the case. 

Step II: We calculate the momentum densities thus: 

n % = a(&) = n *t = = °- ( 12 - 25 ) 

Notice that there’s no momentum density conjugate to the field T', which is the 
consequence of our choice in Example 12.3. The momentum allows us to calculate 
the Hamiltonian density: 


n = - c 

= — V’P'Lx) ■ V$(x) + V(x)'I' t (a;)4'(a:). 

2 m 


(12.26) 


Lo and behold, we have obtained a Schrodinger-like equation for the Hamiltonian 
density. 

Step III: The equal-time commutation relation between the position and momen¬ 
tum operators is 

[$(*,*), ft&(t,f/)] = i S&Hx-y), (12.27) 

^(t, cc), 4^(i, y)J = S^\x — y), 

where the second equality follows from inserting our expression n^, = iT^. 

Step IV: A mode expansion with positive and negative frequency parts won’t 
obey the commutation relation above. The mode expansion which does the trick is 
simply 

d3 P - 


(a:) = 


/ 


(2tt): 


(12.28) 


with E v = h{ - 


Step V: Substituting the mode expansion gives us a Hamiltonian operator 

(^{t,x)e-'< E p- E ^ t e i ^~ q)x a\,a q ^ 


H = 


h 3p (£n^) + ! 


d 3 # d 3 pd 3 q 
(27r) 3 


(12.29) 

The time-dependent part will guarantee conservation of energy. It forces the potential 
to have a time dependence of the form V(t,x) = e 1 ( E P~ E ^ t V(x) if the momentum 
of the particle is going to change. With this constraint we obtain a Hamiltonian 

+ / d3 P d3 1 {yip- - 

where V(p—q) = f d 3 Xj^j^V(x)e 1 ^ p ~ q ^' x and the potential must impart an energy 
Ep—Eq . Notice that this is equivalent to the Hamiltonian we argued for in Chapter 4. 


H = 


f 


d 3 P 


(12.30) 


We can look at the non-relativistic complex scalar field in another way 
by considering a different form for the field. We first note that the non- 
relativistic form of the Lagrangian is still invariant with respect to global 
(7(1) transformations. Motivated by this, we write 


*(*) = ^/wy 0[x \ 


(12.31) 
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so that \I/( 2 ;) is written in terms of amplitude and phase. The U(l) 
transformation is now enacted by 8 —> 9 + a (see the box for the con¬ 
served charges). Whereas before we had two fields, (f>i(x) and <j> 2 {x), 
now we have two fields, p{x) and 9{x). Substituting in the Lagrangian 
in eqn 12.23, we obtain (Step I) the Lagrangian in terms of p and 9 
fields 




1 

4 p 


(Vp) 2 + p(V0) 2 ) 


9 2 
2 P • 


(12.32) 


Noether’s theorem: 
symmetry 

U( 1) internal 

DO = 1 

n °e = -p 

ni = ^- 8*0 

0 m 

DC = 0 

lye = 0 

(5) 

o. 

1 

II 

s? 

J N = --2-V0 

m 

<?Nc = f d 3 xp(x) 

J Nc = JL V6» 


Now we turn off interactions (by setting g = 0) and feed the resulting 
Lagrangian into the canonical quantization machine and make selective 
use of its features. First we find the momenta (Step II): 


n °(a:) = — =1 

pK ’ d(d 0 p(x)) 2’ 

ctC 

nSW = WW) = '' 1 ' 1 ’ (1233) 


from which we see that the Ilg is most interesting since it’s not a con¬ 
stant. Let’s make this quantum mechanical (Step III) by imposing 
commutation relations 




9{x,t),p{y,t) 


iS^(x — y). (12.34) 


The box shows that the time-like component of the conserved current 
is p{x). We therefore define the total number of particles as N(t) = 
f d 3 a; p(x, t) and by integrating 9 the commutation relation equation we 
obtain 


N(t),6(x,t) 


(12.35) 


which is an important result. 10 It tells us that in the sort of coherent 
condensed matter system that can be described by the non-relativistic 
complex scalar field theory the operator for the number of excitations 
is conjugate to the operator for the phase angle of the field. We will 
return to this important idea when we come to discuss superfluids and 
superconductors. 


9 This calculation simply uses 

J d 3 y[p{y,t),0(x,t)\ = [N(t),0(x,t)] 
and 

J d 3 j/(5 (3) (a; - y) = 1. 

^Equation 12.35 is called the number- 
phase uncertainty relation. 


Chapter summary 

• Quantization of the complex scalar field leads to two types of ex¬ 
citation: particles and antiparticles. The conserved charge is the 
number of particles minus the number of antiparticles. 

• The non-relativistic limit of the theory may be taken. From this 
we can derive a number-phase uncertainty relation. 
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Exercises 


(12.1) Fill in the missing steps of the algebra that led to 
eqn 12.6. 

(12.2) Evaluate (a) [ip(x), and (b) ['F(ir), 4A(?/)] us¬ 

ing the appropriate mode expansions in the text. 

(12.3) Consider the Lagrangian for two scalar fields from 
Section 7.5. 

(a) Evaluate [Qn, 0 i] and (b) [Qn,02 ], where Qn is 
the Noether charge. 

Hint: You could do this by brute force and evalu¬ 
ate Q n and then find the commutator. A preferable 
method is just to use: [Qn,^] = —i D<j>i from Chap¬ 
ter 10. 

(c) Use these results to show that [Qn,V’] = ip- 


(12.4) Consider the theory described by the Lagrangian 
in eqn 12.32. Use Noether’s theorem in the form 
[Qn,</>] = —i Dtp to provide an alternative deriva¬ 
tion of eqn 12.35. 

(12.5) Apply the Euler-Lagrange equations to eqn 12.24 
and show that it yields E p = p 2 /2m when V = 0. 

(12.6) Find the Noether current for the Lagrangian in 
eqn 12.24. Check that it is what you expect for 
non-relativistic quantum mechanics. 

(12.7) Consider the complex scalar field again. The inter¬ 

nal transformation operator may be written U(a) = 
giQNc°y where Qn c is the conserved number charge 
operator. Show that (a)'ip(x)U (a) = e la ip{x). 



Fields with many 
components and massive 
electromagnetism 



In the previous chapter we described a theory with two field components. 
Here we expand the discussion to theories with more components and 
we will see that the interesting thing about these components is their 
symmetries. We’ll examine two cases, the first of which has an internal 
symmetry and a second where the theory is described in terms of four- 
vectors. 
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13.1 Internal symmetries 


When we find families of closely-related particles in Nature, there is 
an inevitable lurking suspicion that these particles are really a single 
physical entity but that there is some internal dial which can be rotated 
to turn one member of the family into another. The invariance of the 
Lagrangian with respect to turns on this internal dial is described as an 
internal symmetry, quite unrelated to the symmetry of spacetime in 
which the particles live, move and have their being. 1 

An early example of the idea of an internal symmetry was isospin. 
The neutron and proton have almost the same mass (m p ss ?n n ) and this 
led Heisenberg to wonder if they were the same particle in two different 
states with some kind of internal symmetry that he termed isospin. The 
idea is that neutrons and protons both have isospin / = | but that, as for 
conventional spin, there are two possible eigenvalues of the I z operator: 
I z = 1/2 (the proton) and I z = —1/2 (the neutron). By analogy with 
spin angular momentum, we may arrange these into a two-component 

object y n ^ ’ known as an isospin doublet. We can rotate these isospin 

doublets with the same matrices 3 that rotate spins-1/2. Before we get 
too carried away, we should remember that, strictly, m v ^ m n so that 
isospin is only an approximate description. 4 

The idea of an internal symmetry nevertheless has wide applicability. 
We will explore this idea using the example of three scalar particles. We 
imagine arranging them into a vector (i, d , h) which transforms internally 
according to the three-dimensional rotation group SO( 3). Rotating the 
internal dial by 90° about the h-axis turns a t particle, denoted (1, 0,0), 
into a d particle (0,1,0) (see Fig. 13.1). This also implies that super- 


1 We saw the consequence of a t/(l) in- 
ternal symmetry in the complex scalar 
field in the last chapter. 


o 

Note that isospin has nothing to do 
with the actual spin angular momen¬ 
tum of the particles. 


^ As explored in Chapter 37, these rota¬ 
tions form a representation of the group 
SU(2). 

4 Of course protons and neutrons are ac¬ 
tually each composed of quarks. 
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Fig. 13.1 In our toy model, particles 
can be imagined as having an internal 
dial. Whether the dial points along 
(1,0,0), (0,1,0) or (0,0,1) determines 
which kind of particle it is. The po¬ 
sition of the dial is not related to the 
directions in spacetime through which 
particles move, but expresses a direc¬ 
tion in abstract space describing the in¬ 
ternal symmetry. 


positions of particles, obtained by some arbitrary rotation of the dial, 
have just as much validity as the original particles do. Since particles 
are excitations in fields, we should be able to examine isospin in field 
theory. In the field case we arrange the fields that generate these par¬ 
ticles into a vector such as (0i,02,03)- Subjecting this column vector 
to internal rotations by turning the internal dial, one can carry out the 
equivalent act of turning a 0i field into a <j >2 field by rotating about 
the 03-axis. If the Lagrangian describing the theory of these particles is 
invariant with respect to these rotations, then Noether’s theorem gives 
us a conservation law describing the charges of the fields. 

To simplify our notation, let’s write the three scalar fields as a column 
vector <I>(:r) as follows 

( M x ) \ 

$(z) = Mx) ■ (13.1) 

V Mx) ) 

The free Lagrangian (step I of the Canonical quantization process) for 
this theory is given by 

i 77} ^ 

C = -{d^)-{d^) (13.2) 


which is short for 

£ = \ [(<9p0i) 2 - m 2 0 2 + (d^fo) 2 - m 2 (j >2 + (<9^03) 2 - m 2 03] , (13.3) 


that is, the sum of the Lagrangians for the individual fields. Note care¬ 
fully that the object <&(#) is not a vector field that lives in Minkowski 
space like the vectors x M and p^ do. This point is examined in the 
following example. 


Example 13.1 

Here, unlike elsewhere, the dot products aren’t short for a sum g /JfV A IJ, A L '. The 
vector # doesn’t live in Minkowski space at all, so has no dealings with the tensor 
g^ Instead, the dot products here are short for 

$ • $ = + <t>3<j>3, (13.4) 

and 

<9 M $ ■ (13.5) 

This is to say that the a in i&a isn’t a tensor index and there is consequently no 
difference between T' 1 and f Iv,. 


Our example has an SO( 3) internal symmetry: we can transform the 
vector $ using a three-dimensional rotation matrix and the Lagrangian 
won’t change one iota. That is to say that there is no effect on the 
Lagrangian on transforming to a field & via a transformation such as a 
rotation about the 0 3 -axis: 

/ 0j \ / cos 0 — sin 6 0 \ / 0i \ 

( 02 I = ( sin 0 cos 9 0 J | 02 ] . (13.6) 

V 03 ) V 0 0 1 / V ) 
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The canonical quantization for this theory proceeds as before, but we 
now implement our new notation. This leads to a Hamiltonian (step 

II) 


\{d^ a ) 2 + \(yKf + 


^ = E 

a 

The equal-time commutation relations (step III) become 
[$ a (t,®),n^(t,y)l = i6 (3) (x - y)8 a p, 


Note that n££ = . 

“ d(d^<Z> a ) 


(13.7) 


(13.8) 


including a Kronecker delta because each component of 3? has a nonzero 
commutator with the same component of its own momentum field only. 
The mode expansion (step IV) is then given by 


<fr(x) 


d 3 p 1 
(2 tt)§ (2 E p )h 


( a pl e~ ipx + aL&v* \ 

a P 2e~ ip ' x + sLp> x , 
V a p3 e~ ip - x + a\ 3 e^ x ) 


(13.9) 


where a pa are the annihilation operators for the a field and we have the 
commutators [a pon a q p\ = S^ 3 \p — q)8 a p and all others vanish. 


Example 13.2 

We could also abbreviate the notation slightly by writing this as 



d 3 p 1 
(2ir)i (2E p )2 


y. 


(13.10) 


where 



(13.11) 


are internal vectors that tell us about the polarization of the field in its internal space. 


Substituting into the Hamiltonian for this theory then leads 5 to the 
result that 


H = 


d 3 p E p E ®pQ & pa i 

a —1 


(13.12) 


c 

One needs to substitute the mode ex¬ 
pansion and normal ordering steps IV 
and V. 


and we observe that the energy of the particles may be found by summing 
over all momenta and polarizations. 


Example 13.3 

Noether’s theorem reminds us that a symmetry gives us a conserved quantity, so we 
examine the internal SO (3) symmetry of our fields. The Lagrangian is symmetric 
with respect to the transformation — 6 x 3>. For example (and assuming 

sums over repeated indices), rotations about the ^ 3 -axis give < h a —» <J> a — £ a3c # 3 <|> c 
and we have 


Noether’s theorem: SO( 3) inter¬ 
nal symmetry 

]jb^a _ _ £ abcfyc 

DC = 0 = 0 

Qnc = /d 3 a;(# x <9 0 $) 

N [Onc] = ~i f d 3 pe abc a f bp a cp . 


D 3 ^ 1 = <t> 2 , D 3 4 > 2 = 


■(f ) 1 , D 3 <f> 3 = 0, 


(13.13) 
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^We dispense with the normal ordering 
symbol from this point. It should be 
assumed in all calculations of energies 
and conserved charges. 


The theory described in this section 
is rather like quantum electrodynam¬ 
ics but, it turns out, without the inter¬ 
esting complication of gauge invariance, 
which we leave until the next chapter. 

^This is in contrast to our example of 
the internal 50(3) field <!>(#). 


^ Equation 13.18 can of course be writ¬ 
ten out purely in terms of A^ as 

d^ (0 fX A u - d v A») + m 2 A u = 0. 


where we call the symmetry from rotation around the 6-axis D b & a . Using the fact 
that, as usual, = 0 for this internal transformation (since DC = 0), this yields a 
Noether current for rotation around the internal 3-axis: 

j3m = n a MD 3^a = (d^ 1 )^ 2 - (d^ 2 )^ 1 . (13.14) 

Swapping signs over, the conserved charge is given in terms of (normally ordered 6 ) 
creation and annihilation operators by 



Rotations about the 1- and 2-axes will lead to similar conserved charges in terms of 
the other field operators. In general the conserved charge will be given by 

Q Nc = /d 3 i($xW) and Q^ c = ~ifd 3 pe abc dl p d C p. (13.16) 

This conserved charge is isospin. As shown in Exercise 13.1, isospin has the structure 
of angular momentum: in this case an isospin-1. Remember that, despite the similar¬ 
ity between the angular momentum in Exercise 3.3 (which arises since the symmetry 
is with respect to spatial rotations) and the conserved charge in this example, the 
symmetry here is internal and so this angular momentum-like charge is internal to 
the field. 


13.2 Massive electromagnetism 

We now turn to the interesting case of the massive vector field. This 
theory describes an intriguing world' in which photons have mass! The 
theory is described in terms of A^(x) which is called a ‘vector field’ 
because, when we input a spacetime point x, the field outputs a four- 
vector A M corresponding to that point. The key observation about A M 
is that it lives in Minkowski space and so transforms in the same way as 
a four-vector does under the Lorentz transformation. 8 We’ll see that the 
particles created by the quantum version of this field have three possible 
polarizations, which will lead to the excitations carrying a spin of S = 1. 
The Lagrangian for this theory (Step I) is given by 

C = A F ^F^+ l -m 2 A^, (13.17) 

where = d^A v — d„A^. This is just the Lagrangian for the electro¬ 
magnetic field but with a Klein-Gordon-esque mass term on the right- 
hand side. Note that the sign of the second term that contains the factor 
m 2 is positive. In the Klein-Gordon equation the m 2 term was nega¬ 
tive, but the reason for our sign choice here will become apparent in the 
following example. 


Example 13.4 

We can find the equations of motion for the field in this theory by plugging the 
Lagrangian (eqn 13.17) into the Euler—Lagrange equation. The result is that 9 

+ m 2 A" = 0. (13.18) 
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This is known as the Proca equation. Taking the divergence gives us zero from the 
first term (since = 0) and we obtain from the second term m 2 d u A u = 0. 

Since m 2 / 0, we have the additional condition that the field obeys = 0. This 

allows us to write the Proca equations in another form 

(a 2 +m 2 ) A u = 0, (13.19) 

which looks a lot like the Klein-Gordon equation 10 for each of the four components 
of the field A u (x). The dispersion relation is therefore E 2 = p 2 + m 2 for each 
component. 


Alexandru Proca (1897-1955) devel¬ 
oped the theory of massive spin-1 par¬ 
ticles using field theory. 


1(1 This shows that we did indeed make 
a sensible sign choice in eqn 13.17. 


We now continue the procedure of quantizing this theory. 11 Finding the 
momentum density 11^ (x) for each field component a is the first part of 
Step II, which is easily done. Extending the definition of H£(x) from 
the last chapter to include the vector index on A", we have: 

dC 

W v {x) = 575 = - 8 P A V . (13.20) 

0\Pij,A v ) 

Note that Tl^ is now a second-rank tensor whose two indices can be 
raised or lowered using the metric tensor g pu . We then have momentum 
densities 

n 00 = 0, II 0i = d l A° - d°A i = E\ (13.21) 

where E is the ‘electric field’ of the theory (a useful name for notational 
simplicity, but that’s all since this is not electromagnetism!). Notice that 
n 00 = 0 means that A 0 has no dynamics of its own. 12 We will eliminate 
it in the next step (and justify this in step IV). We find the Hamiltonian 
density to be 13 

H = l (e 2 + B 2 +m 2 A 2 + E(V ■ E) 2 \ . (13.22) 

2 \ m z ) 

Notice the positive contribution from the mass term, which justifies the 
choice of sign in the Lagrangian in eqn 13.17. 

Next we need some equal-time commutation relations (step III) to 
make all of this quantum mechanical. Clearly we want a way to deal with 
the different components of the vector field which, when we substitute in 
a sensible mode expansion, will yield up a total energy which is the sum 
of the positive energies of all of the particles. Since we have eliminated 
A 0 from the Hamiltonian in eqn 13.22 we only need relations between 
the space-like components (i and j). We choose 


A'(t,x),E J (t,y) 


= ~iS {3 \x - y)g 13 , 


(13.23) 


which has the form we had for previous cases (namely [(f), docf>\ = i5 since 
g lJ = —Sij ) for each spatial component. 

We may now expand the field A M (step IV) in terms of plane 
waves. Since these will look something like A ~ a p e~ lp ' x , the condi¬ 
tion d p A p = 0 implies that p p A p = 0. This tells us that the four-vector 
A p is orthogonal to p p , just as we have in three dimensions if c • d = 0 
for vectors c and d. Moreover, the condition p^A 1 * = 0 introduces a con¬ 
straint on the components of A p : they are no longer independent and 


11 The rest of this chapter works care¬ 
fully through the nuts and bolts of get¬ 
ting from the Lagrangian in eqn 13.17 
to the final result of the Hamil¬ 
tonian in eqn 13.31, which is our 
familiar second-quantized harmonic- 
oscillator type Hamiltonian. 


12 

This lack of dynamics might cause us 
to worry that the quantization routine 
won’t work on this field component. In 
fact, we will eliminate A 0 and so this 
lack of a momentum won’t turn out to 
be a problem. This is quite unlike the 
case of QED where the problem of hav¬ 
ing II 00 = 0 is a knotty one. See Aitchi- 
son and Hey, Section 7.3 for a discus¬ 
sion. 

13 

In four steps: 

I. Write 

H = E A--(E 2 - B 2 ) 

2 

II. Use eqn 13.21 along with 

E ■ VA° = V • (EA°) - A°( V • E). 

III. Then use the equations of motion 
(eqn 13.18) to eliminate A 0 via 


IV. The Hamiltonian includes a term 
V • (EA°), but this is a total deriva¬ 
tive. We constrain all of our fields to 
vanish at infinity, so we may drop the 
total derivative from the Hamiltonian 
density. 
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^The polarization of the vector parti¬ 
cle tells us about its spin angular mo¬ 
mentum. Vector particles have S = 1 
and the three degrees of freedom cor¬ 
respond to S z = 1,0 and —1. This is 
examined further in the next chapter. 


15 

Unlike y , the label A on the polar¬ 
ization vector is not a tensor index, it’s 
part of the name that tells us the polar¬ 
ization vector to which we’re referring 
We could equally have called the vec¬ 
tors Tom, Dick and Harry. However, 
as we have done here, it’s often useful 
to use A as a shorthand to also tell us 
along which direction the polarization 
vector points in the rest frame of the 
particle. 


^The commutation relations between 
the creation and annihilation operators 
are 

[ a p>> a qA'j = <5 (3) (p-q)<5AV- (13.25) 


we may express one of the components of A M in terms of the others. The 
theory therefore describes a field (and particles) with three independent 
polarization degrees of freedom. The polarization degrees of freedom 
are taken to be A 1 with i = 1,2,3. The component A 0 is completely 
determined via p ■ A = p°A °, which shows that we were justified in 
eliminating it from eqn 13.22. 

Now for the mode expansion. As in the case of the 50(3) fields which 
we wrote as 3>, we’ll need separate creation and annihilation operators 
for each polarization. Unlike that case, however, we need to multiply 
each by a polarization vector 14 e(((p) which lives in Minkowski space 
and whose components depend on the value of the momentum of the 
particle that we’re considering. 15 

Since there are only three degrees of freedom there will be three po¬ 
larizations A = 1,2, 3. We choose that there is no A = 0 polarization 
[that is, we will force the field to be orthogonal to the O-direction in 
the rest frame of a particle, so we won’t require a time-like polarization 
basis vector e^ =0 (m, 0) = (1,0, 0,0) (see below)]. The resulting mode 
expansion is written 
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or, more compactly 16 


A»{x) 


d 3 p 1 

(27r)f (2 E p )i 


( £ a (P) 


A—1 


Clp\£ 


—i p-x 



(13.26) 


Since p^A^ = 0, we’ll require p^e^(p) = 0, which shows how the polar¬ 
ization vectors depend on the momentum. 


Example 13.5 

The next job is to work out what we want the polarization vectors to be. Being vectors 
and living in Minkowski space, these will have to transform like vectors under the 
Lorentz transformations. It will be a good idea to have these vectors orthonormal so 
we want 


<Up) • £ \'(p) = 9^ £ x* £ y = - s \y> 


(13.27) 
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where the minus sign comes from our metric. 17 Consider a particle in its rest frame, 
where it has momentum p^ = (m, 0,0,0). As stated above, we need p^e\^{p) = 0, 
for all A. That is, we want the polarization vectors in this frame to be normal to this 
p One possible choice is to use linear polarization vectors, given by 



( 0 \ 


( o \ 


( 0 \ 

o 

II 

; 

II 

o' 

jf 

0 

1 

, £3 (m, 0) = 

0 

0 

1 

l 0 ) 

! 1 

l 0 ) 


l 1 / 


(13.28) 


We can now work out the value of ( p ) in an arbitrary frame of reference by boosting 
with Lorentz transformation A(p). For example, a particle moving with momentum 
p z = \p\ along the ^-direction has p^ = ( E p , 0, 0, \p\) which is achieved with a boost 
matrix 



( E p 

0 

0 

IpI \ 

1 

0 

m 

0 

0 

m 

0 

0 

m 

0 


\ IpI 

0 

0 

E p J 


(13.29) 


yielding the polarization vectors 


ei(E p , 0 , 0 , |p|) = 

( 0 \ 
1 

0 

, £2(s p ,o,o, |p|) = 

( 0 \ 
0 

1 

, e 3 (E p , 0 , 0 , |p|) = 

( \p\/ m \ 
0 

0 


l 0 ) 


l 0 ) 


\ Ep/m 
{ 13.30) 


17 

Note that despite A not being a ten¬ 
sor index, the orthogonality condition 
is often written g\y , where is the 
Minkowski tensor. 


Finally, using the mode expansion, 18 the Hamiltonian after normal or- 
dering (step V) is diagonalized in the form 


H = 



(13.31) 


1S The expansion is rather tedious. For 
help see the careful explanation in 
Greiner and Reinhardt, Chapter 6. 


meaning that the total energy is the energy of all particles in all polariza¬ 
tions as we might have expected. Although the complication of having 
three field components is irritating, the algebra involves little more than 
the tedious job of keeping track of some new indices and the polarization 
vectors. 


13.3 Polarizations and projections 

Clearly, the added complexity of the vector field arises because of the 
different polarizations which are possible. In this section we’ll develop 
a tensor toolkit to deal with this feature. 19 

Let’s start with something very simple. To project a three-vector X 
along the direction of a vector p (see Fig. 13.2), then all we have to do is 
to take the scalar product between X and p = p/\p\ (to find the length 
of the component), and then multiply the result by p to point it in the 
correct direction. The projected vector is then Xl where 


19 This section can be skipped on a first 
reading. 





(p ■ X)p 

l_l 0 . > 


(13.32) 


where the ‘L’ superscript stands for ‘longitudinal’. The transverse part 
of X is then Xt = X-Xl. We can write these equations in component 
form as 20 


Fig. 13.2 Projecting a three-vector X 
along the direction of a vector p. 

2 ^Assume summation over repeated in¬ 
dices. 
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Aj' - /’"A ' and Aj r'y'XF (13.33) 

where the longitudinal and transverse projection matrices and 
are given by 


pa = tlL 
L N 2 ’ 


(13.34) 


and 

Pt = ~ (13.35) 

The four-dimensional upgrades of these are immediate: we define pro¬ 
jection tensors (returning to tensor notation) 

Pr = ^f and pp =fr -W (is.se) 

P Z P Z 


21 

If we work in a basis of circular po¬ 
larizations a slightly more complicated 
identity is needed. 

I E [ e ^(p)Cx(p) +^a(-p)^a(-p)] 



(13.37) 

See Maggiore, Chapter 4. 


For our massive spin-1 modes pP represents the energy-momentum four- 
vector, so p 2 = m 2 . If we work in a basis of linear polarizations, the 
transverse projection tensor can be related to the polarization vectors 
via the following equality 21 

3 

£ a m (j?)£a v {p) = ~Pj„- (13.38) 

A—1 

This relationship is proved in the following example: 


Example 13.6 

Method 1: In the rest frame p^ = (m, 0,0,0) and the polarization vectors are given 
by eqn 13.28 and hence we have 


3 

E 

A =1 


( 0 

0 

0 

0 ^ 

0 

1 

0 

0 

0 

0 

1 

0 

V o 

0 

0 

1 ) 


(13.39) 


One can then boost this second-rank tensor using the matrix of eqn 13.29 to give 


3 

E y.{p)^\AP) 

A =1 


a^a % E e\ P (m, 0)e ACT (m,0) 

A —1 

( \p \ 2 0 0 —E p \p\ \ 

1 0 m 2 0 0 

m 2 0 0 m 2 0 

V —E p \p\ 0 0 El 


— —gpv + 


(13.40) 


PnPv 


Method 2: The quantity 5 Za=i £ Au £ Ai/ is a tensor and so it must be built up from 
other tensors. Since the only other tensors we can build from the variables in 
the problem are and p^Pu we’ll assume it has to be a linear combination of 
these: Ag^ u + Bp^p u . For a particle at rest we need A + Bm 2 = 0 in order that 
Ea-_i €A 0 e A 0 — 0, which fixes B = —A/m 2 . Next, for a particle at rest we need the 
diagonal space-like components to equal 1, giving us A = —1. 
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Chapter summary 

• Putting more components into our theory introduces the notion of 
the polarization of our field. The Hamiltonian for a simple non¬ 
interacting Lagrangian again yields harmonic oscillator solutions, 
but this time the energy requires a sum over all momenta and over 
polarizations. 

• We have applied this to a Lagrangian for massive electromag¬ 
netism, 

C = ~F liV Fi" + ^m 2 A p A», 

and have shown that the polarization vectors e M (p) are related to 
the transverse projection tensor. 


Exercises 


(13.1) (a) Show that the conserved charge in eqn 13.16 
may be written 

Qnc = / d 3 p Al JA P , (13.41) 

where A p = (ai p ,a, 2 P , & 3 P ) and J = ( J x , J y , J z ) are 
the spin-1 angular momentum matrices from Chap¬ 
ter 9. 

(b) Use the transformations from Exercise 3.3 to 
find the form of the angular momentum matri¬ 
ces appropriate to express the charge as Q Nc = 
/d 3 pBpJB p where B p = (bi p ,b 0p ,b-i p ). 

(13.2) (a) Confirm that eqn 13.29 is the appropriate ma¬ 
trix to boost a particle along the z-direction. 

(b) Show that the boosted vectors in eqn 13.30 are 
still correctly normalized. 

(c) Consider the circular polarization vectors 

Ce = i,0), C L = -U(0,l,-i,0), 


e^L 3 = (0,0,0,1). Show that these are correctly 
normalized according to g P vC\*e v y = 

(13.3) Show that Pl and Pt are indeed projection opera¬ 
tors by showing that P 2 = P. 

(13.4) The Lagrangian for electromagnetism in vacuo is 

C = Show that this can be rewritten 

as 

C = ^\{d p A v d^A v -dpA^A 11 ), (13.42) 

and hence show that using the transverse projection 
operator, it may be expressed as 

C = ^ P^„d 2 A". (13.43) 

This shows that C only includes the transverse 
components of the field, squaring with the idea of 
electromagnetic waves only representing vibrations 
transverse to the direction of propagation. 
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Nobody ever reads a paper in which someone has done an 
experiment involving photons with the footnote that says 
‘this experiment was done in Coulomb gauge’. 

Sidney Coleman (1937-2007) 


14.1 What is a gauge field? 


1 ‘Gauge’ is an awful bit of terminol¬ 
ogy with which we are unfortunately 
stuck. Einstein’s general relativity 
showed that spacetime geometry has 
a dynamical role and Hermann Weyl 
wondered if the scale of length could 
itself be dynamical, varying through 
spacetime. In this picture, one could 
make a choice of gauge which would 
be a choice of scale-length: metal wire 
comes in different thickness or gauges, 
so the term seemed entirely appropri¬ 
ate. He later adapted his scale argu¬ 
ment to one involving phase, as out¬ 
lined here, but the name ‘gauge’ stuck. 


We continue our discussion of invariances possessed by physical systems 
and arrive at gauge invariance. The idea here is that our field theory may 
admit different configurations of the fields which yield identical observ¬ 
ables. Our physical description thus contains some inherent vagueness 
and so we can make a choice about which particular formulation, out 
of the many possible ones, to adopt in a given situation (this is called 
‘a choice of gauge’). A transformation from one description to another 
is called a gauge transformation, and the underlying invariance is 
called a gauge invariance. 1 Note that gauge invariance is not a sym¬ 
metry. Particles do not carry around a knob called ‘gauge’ that allow 
us to change them into other particles. Gauge invariance is merely a 
statement of our inability to find a unique description of a system. 


Example 14.1 


(i) In electrical theory we have to choose a zero for our potential V (i.e. choose 
which potential we call ‘ground’). This is because we only measure potential 
differences (e.g. with a voltmeter) and so the choice of zero potential is an 
arbitrary one. 


(ii) In quantum mechanics we have to make a choice of what we mean by the zero 
of phase of a wave function. The choice is arbitrary because we only measure 
phase differences (e.g. in an interference experiment). There is nothing stop¬ 
ping one transforming all wave functions according to ip(x) —> -ifi(x)e ia and 
the physics is unchanged. 


(iii) In electromagnetism, the magnetic vector potential A can be transformed 
according to A —> A + Vy, where x( x ) is a function of position, and B = 
V x A will be unaffected (because V x = 0). Thus one has to choose the 
form of A. 
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The preceding example raises some important issues. 

• The potential V and A , and the quantum mechanical phase, all 
possess an intrinsic vagueness in description, and we have to make 
specific choices in each case. We might wonder if the choices are 
connected. It will turn out that they are. 

• We thought about our choice of zero for V and phase as being 
something we make once for our particular description of a particu¬ 
lar problem, but which is valid for the whole Universe. If zero volts 
is set differently in New York and in San Francisco, then problems 
will occur if you run a conducting wire between their respective 
electrical grounds. However, A(x) is chosen with a function x( x ) 
which varies from place to place. Thus we will need to distinguish 
between global (affecting everywhere in the Universe) and local 
(affecting only a particular point) gauge transformations. 

With these points in mind, it is time to start exploring this subtle in¬ 
variance, and by doing so we will shed new light on the origin of elec¬ 
tromagnetism. 

We start with another look at the complex scalar field theory 

C = (d^ip) — m 2 ^^. (14.1) 

Recall from Section 7.6 that this has a U{ 1) symmetry. We can make 
the replacement 

ip(x) —> ' l P{x)e la , (14-2) 

and the Lagrangian (and by extension the equations of motion) aren’t 
changed. The most important point to note about eqn 14.2 is that it’s a 
global transformation: it changes the field ip(x) by the same amount 
at every spacetime point. We say that the theory, given in eqn 14.1, is 
invariant with respect to global U{ 1) transformations. 

It turns out that an incredibly rich seam of physics is revealed if 
one asks what happens if the value of a given in eqn 14.2 depends on 
spacetime. That is, we investigate the transformation 

ip(x) -► i/>{x)e ia ( x \ (14.3) 

where the function a(x) allows us to transform the field by a different 
amount at every point in spacetime. This is a very significant change. 
Your first reaction to this might be to say that surely no theory could 
be invariant with respect to a different change in phase at every point 
in spacetime! Let’s see what actually happens. 


Example 14.2 

The good news is that the mass term mb/U/ ? doesn’t change when we transform 
locally. The bad news is that the term involving the derivatives does. This is because 
the derivative now acts on o(x). In fact, we have 

d^ip(x) -* d fl ip(x)e la ‘ ( - x l 

= e ia( - x) d^lx) + bOUe^MicVaOr) 

= e ta W[^ + i ^a(x)]V>(x). 


(14.4) 
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2 

For the case of QED examined in 
Chapter 39 we will be able to identify q 
as the electromagnetic charge. For now 
we treat it as a parameter. 


Similarly 

d^{x) -► e - ia W [<9^ - id»a(x)\ (a:). (14.5) 

The first term in the Lagrangian becomes 

+ {d^a){d^a)^ i/j, (14.6) 

which is certainly not what we started with. 


As expected, having a depend on x has robbed the theory of its (7(1) 
symmetry. We say that it is not invariant with respect to local (7(1) 
transformations. Undaunted, we’re going to see if we can restore this 
local symmetry. We’ll do this by adding a new field A^(x) 1 designed 
to roll around spacetime cancelling out terms that stop the theory from 
being invariant. The way this happens will depend on the manner in 
which the new field transforms. The way to proceed is to introduce the 
field A^(x) via a new object D M given by 

+ iqA^x). (14.7) 

This new object D M is called the covariant derivative. This will fix 
up the local (7(1) symmetry if we insist that the new field A M transforms 
according to A M —> A M — ^d^a(x). The parameter q is known as the 
coupling strength and will later tell us how strongly the A M field interacts 
with other fields. 2 


Example 14.3 

If il>(x) —► il>(x)e loc ( x l, then d^ip —► (9(iV’) ela + ifd/i®) 1 /’ and so 

= (t?/x + i<UU)7’ —*■ (d M -i/ , ) el “ + + igA M V , e 1 “ - i 

= DA*pe ia ). (14.8) 

This property makes the whole Lagrangian invariant if we replace ordinary derivatives 
by covariant ones: 

C = (D'>) t (TWO - (14.9) 

since now with —> Dn,ipe iot , the first term is invariant. 


Thus we conclude that we need our theory to be invariant with respect 
to two sets of transformations, which must be implemented together: 

-► ip{x)e ia< ~ x \ (14.10) 

A^(x) -> A^x) - ^d^x). (14.11) 

A theory which has a field A^ (x) introduced to produce an invariance 
with respect to local transformations is known as a gauge theory. 
The field A /J, (x) is known as a gauge field. 

We might expect the field A IJ (x) could make a contribution to the La¬ 
grangian itself. It only exists in our description because we’ve invented it 
to satisfy our demand for a locally invariant theory, but if such ambitions 
have any groundings in reality then the field A M should have dynamics of 
its own! As we will see in the next section, it gives us electromagnetism. 
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14.2 Electromagnetism is the simplest 
gauge theory 


What are the possible forms of the part of the Lagrangian describing the 
field A^(x)? It must be a theory for which the Lagrangian is invariant 
under transformations of the form A M (x) —> A^(x) — ^d M a(x). Electro¬ 
magnetism provides an example of such a theory. It is, after all, a theory 
described in terms of a vector field A^(x) = (^(a;), A(x)) which is used 
to form a Lagrangian 

C = -\{dpA v - d v A ll )(& i A v - d v A») - J^A», (14.12) 

from which the equations of motion follow as 


d 2 A" -d v {d li A») = J" m , (14.13) 


which are known as the first two Maxwell equations. 3 

One of the most important observations about this formulation of 
electromagnetism is that neither the Lagrangian nor the equations of 
motion are changed if one makes the swap 

A^x) -> Ap(x) - d^x{x), (14.14) 

which is short for the changes 

V - V-doX, 

A -► A+V\- (14.15) 

This observation, which is known as gauge invariance, means that if A^ 
does the job of describing the electromagnetic fields in some situation 
then so does A^ — d^x- This also means that electromagnetism is a gauge 
theory since we can call x( x ) by the name a(x)/q and the condition in 
eqn 14.11 will clearly be satisfied. 


^ Recall that we wrote these in Chap¬ 
ter 5 as d\F Xu = Jg m (where F Xu = 
d x A u — d u A x ), which is clearly the 
same. More interestingly, it is worth 
considering the role of the Maxwell 
equations in this context. In the next 
section we will quantize the theory 
in terms of electromagnetic particles 
called photons, which are excitations of 
the quantum field A(x). The Maxwell 
equations are the equations of motion 
for A{x) and the closest we have to a 
Schrodinger equation for the photon. 
As it is not possible to construct a prob¬ 
ability density from the field A(x), the 
photons that the quantum field theory 
of electromagnetism describes do not 
have an interpretation in terms of a 
probability amplitude of their spatial 
localization. See Berestetskii, Lifshitz 
and Pitaevskii (Introduction and Chap¬ 
ter 1) for further details. 


Example 14.4 

The theory can also be written in terms of the tensor F^ u = d^A u — d u A^ with a 
Lagrangian written as 

C = ~F^F^. (14.16) 

Gauge invariance amounts to the statement that we can change the field according 
to A^ —» A^ — d^x( x ) an d F^ u is unchanged. 


It may be helpful to think of the freedom of the choice of gauge as a 
choice of language. We can speak French, German or Venusian, but 
we are able to communicate the same underlying message no matter 
what language we speak. If we’re sensible, we can choose x( x ) (that is, 
the language we’re speaking) in such a way as to make whatever we’re 
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^ Actually it’s more usually (and incor¬ 
rectly) known as the Lorentz condition 
due to its misattribution to Hendrik 
Lorentz (1853-1928) rather than to the 
less famous Ludvig Lorenz (1829-1891) 
who used it first. See J. D. Jackson and 
L. B. Okun, Rev. Mod. Phys. 73, 663 
(2001) for details of the history. 


describing as simple as possible. There are several common choices, the 
most famous of which are listed in the box in the margin. 

Let’s try making a choice of x( x )- We’ll try choosing x( x ) i n such a 
way that 

d^{x) = 0 , ( 14 . 17 ) 

which is known as the Lorenz condition or as Lorenz gauge . 4 


Common gauges 

for electromag- 

netism 


Lorenz gauge 

8^ = 0 

Coulomb gauge 

- V- A= 0 

Axial gauge 

A 3 = 0 

Weyl gauge 

A 0 = 0 


Example 14.5 

To do this we write .1 ,, —> .1 ( = A p — d^x- We want 

d»A' p = d p A p - d»d pX = 0 , ( 14 . 18 ) 

which is achieved by setting d 2 x — d M A M . This results in the desired condition 
= 0 . 


Lorenz gauge is useful because, in the absence of a current J p m , it re¬ 
sults in a massless Klein-Gordon equation for each component of the 
electromagnetic field. That is, with d^A'^ = 0, eqn 14.13 becomes 

d 2 A lp = 0, (14.19) 

whose solutions are plane waves of the form A p = e p {p)e~ lp ' x with E p = 
|p|. The Lorenz condition therefore makes electromagnetism resemble 
the case of vector field theory. Recall that there the Lorenz condition 
dfj,A p wasn’t a choice, it was mandated by the mass term m^A^A^ in the 
Lagrangian of that theory. In any case, the result is the same: it reduces 
the number of independent components of A' p from four to three. 

However, this still doesn’t make A' p unique! This is because we can 
make a further shift A'^ —> = A'^ — as long as d 2 £ = 0 (so that 
both A ' and A" satisfy the Lorenz condition). To make A ,,p unique, we 
will choose 

d 0 £ = (14.20) 

which implies Aq = 0. With this further choice, the Lorentz condition 
then reduces to V • A" = 0. This is known as Coulomb gauge and 
further reduces the number of independent field components by one. 
This makes it clear that although the field A p has four components, 
the physics allows only two independent components. This can lead to 
trouble when the Held is quantized. 


c 

Remember that the polarization vec¬ 
tors introduced here will carry the in¬ 
formation about the spin state of the 
photon. 


Example 14.6 

The equations of motion in Lorenz gauge read d 2 A^ = 0, which, with A 0 = 0, 
have plane wave solutions 5 A = ee~ ip x . The equation encoding the Coulomb gauge 
condition, V • A = 0, leads to 


p - A = p • e = 0, 


(14.21) 
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which tells us that the direction of propagation of the wave is perpendicular to the 
polarization. For a wave propagating along z with momentum q ** = 0, 0, |qr|) we 

could have 

e i(«) = 

corresponding to linear polarization. We could choose the waves to have circular 
polarization instead, in which case we could have 

6 r (?) = ^ * j > p ^ J • ( 14 - 23 ) 


£ 2(g) = 


(14.22) 


In order to observe the effects of electromagnetism, the electromagnetic 
field must couple to a matter field. We might wonder how to write down 
this coupling. It turns out that the most simple form may be achieved 
with a simple recipe known as the minimal coupling prescription. 
This simply involves swapping derivatives d^ of the matter field for co¬ 
variant derivatives and is illustrated in the example below. 


Example 14.7 

Consider complex scalar field theory in the presence of an electromagnetic field. The 
Lagrangian is written as the sum of the Lagrangians for the two separate theories: 

C = (14.24) 

To enact the minimal coupling prescription we upgrade derivatives of the matter field 
ijj to covariant derivatives: 

C = 

= (<9 M r/d - + iqAfj.il>) ~ rn 2 ip^il> - ^F^F^v 

= — ^Ffj.jjF^ 1 ' 

+ (^-iqA^^^dfji!)) + i(j(d' i i/'t)A /i 'i/) + q 2 ^ipA 11 A^j . (14.25) 

The final line shows the coupling between the A^ field and the 'ip and 't/d fields. The 
strength of the coupling is set by the coupling strength q, which is also known as 
electromagnetic charge. 


The notion that a gauge field, introduced to guarantee a local symmetry, 
will dictate the form of the coupling, or interactions , in a theory is known 
as the gauge principle and is a philosophy to which we will return 
later. 6 

14.3 Canonical quantization of the 
electromagnetic field 

The previous section reminded us that even though A M has four compo¬ 
nents, only two are needed for a physical description (which makes good 


6 See Chapter 46. 
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7 

The gauge field A^ should not, there¬ 
fore, be confused with the vector field 
discussed in the last chapter. The vec¬ 
tor field was massive and this property 
robs it of the possibility of gauge invari¬ 
ance. It also led to its having three de¬ 
grees of freedom. The massless nature 
of the gauge field leads to its gauge in¬ 
variance and its only having two free 
components. 


^This second method may be found in 
the book by Aitchison and Hey and the 
one by Maggiore. 


9 You should find 

n = -(E 2 + B 2 ) + E VA°. 

2 

The last term is removed using the 
same method as employed in the pre¬ 
vious chapter, noting that V • E = 0 
and that all fields should vanish at in¬ 
finity. 


physical sense because light is a transverse wave and therefore has two 
components ). 7 

Having two redundant components of A M flapping around, reflecting 
the freedom to choose which language that we speak, is something we 
might worry about when we quantize the theory. There are now two 
ways to go. We can fix the gauge from the outset in order to reduce 
the number of degrees of freedom from four to two, making a choice 
of which language we’re going to speak throughout, or we can continue 
and see how far we get before the redundancy comes back to bite us . 8 
We choose the first option in the following example, and working in the 
Coulomb gauge we will canonically quantize electromagnetism simply 
by following the usual five steps. 


Example 14.8 

Step I: The Lagrangian is still 

C = -^F„ U F^ = -i(<9 M A„ - - d v A»), (14.26) 

although we’ll need to remember that our choice of gauge dictates A 0 = 0 and 
V • A = 0. Actually, this will be implemented at step III. 

Step II: We find that the tensor has components 

= ——— = ~(d^A u - d u A^), (14.27) 

d(d^Ajy) 

just as in the case of massive vector field theory. Again, the momentum density of 
the zeroth component n 0() = 0, but we don’t care since we have decided to eliminate 
A 0 = 0. The momentum component conjugate to the ith component of the field is 
n 0i = E l , that is, the electric vector field E(x). The Hamiltonian 9 is then 

H=\(E 2 + B 2 ). (14.28) 

Step III: Next we need commutation relations. Our first guess might be that 
[M a?), fl 01 '(y)J = i£( 3 )(* — y)g^ u , similar to the massive vector field. This won’t 

work here though. Rewriting yields pT(£c), (y)j = —i<5^ 3 ^(cc — y)g which looks 
fine until you take the divergence (with respect to x) of this equation and get 

|^A*(£c), E-* (y)J =i 5^\x — y)d-i. (14.29) 

But this should be zero! (This is because di A x = V • A. = 0.) The problem is that 
we haven’t yet ensured that only transverse components of A. are permitted to enter 
the solution. We need a way of projecting out the unnecessary ones to implement 
the gauge condition. The answer to this problem turns out to necessitate the use of 
our projection tensor P T (see Chapter 13). The commutation relation we need is 
given by 

[*(*),#(„)] = ( 143 °) 

= i - y)> 

which gives zero on the right when you take the divergence (exercise). The symbol 
is known as a transverse delta function due to the inclusion of the transverse 
projection tensor P T . This guarantees that our creation and annihilation operators 
satisfy the required relation [d p \, a^,] = S^\p— q)S\\/. 
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Step IV: Next the mode expansion, which only contains those two polarizations that 
are transverse to the direction of propagation: 10 


A»(x) = 


/ 


d 3 P 


(2tt)2 (2E p )2 X=1 


T E ■ (14.31) 


1 Note that the two polarization vec¬ 
tors obey P/j,€%(p) = 0 and are nor¬ 
malized according to g^e^* (p)e^(p) = 
— — 9\\' • 


Inserting this and (Step V) normally ordering, we obtain the expected final result: 


H = 



(14.32) 


with E p = \p\. We conclude that the excitations of the electromagnetic field are 
photons, which may be observed with two transverse polarizations. Apart from the 
slight complication of making the commutation relations compatible with the choice 
of gauge and the hassle of the indices, there’s nothing hard about the quantization 
of electromagnetism in Coulomb gauge. 


The last example showed us that quantized particles of the electromag¬ 
netic held are photons. These particles have spin S = 1 and come in two 
types: (i' pl |0) and dp 2 |0), corresponding to the two transverse polariza¬ 
tions of the electromagnetic held. Consider a photon propagating along 
the ^-direction with momentum = (|<jr|, 0,0, |q|). If we work in a basis 
of circularly polarized vectors, we may write eJ_ R (g r ) = — ^(0, l,i, 0) 
(corresponding to S z = 1) and ca=l( 9) = ^(0,1, —i, 0) (corresponding 
to S z = —1). There are no photons with S z = 0 as this would corre¬ 
spond to an unphysical longitudinal polarization e x _ 3 (p) = (0,0, 0,1). 
We examine this in Exercise 14.2 and in the following example. 


Example 14.9 

We calculated the rotation matrices J for the vector field in Chapter 9. These also 
function as angular momentum operators for photon states. We will create a photon 
travelling along the ^-direction, so that q^ = (|qr|, 0, 0, |qr|). We write 


|7A> = e A («)“],* |0>- 


(14.33) 


Let’s calculate the helicity 11 of photons with A = R. The helicity operator is h = 
J ■ q/\q\, leading to an operator h = J z . Operating with the angular momentum 
matrix J z suitable for 5 = 1 spins, we have 


^ z |7a=r) 


( 0 

0 

0 

0 ^ 

0 

0 

—i 

0 

0 

i 

0 

0 

V o 

0 

0 

0 ) 


r: 


V « 


\ 

) 


q R 


| 0 > 


+|7a=r)- 


(14.34) 


11 Helicity, the projection of the spin 
along the momentum direction, is ex¬ 
plained in detail in Chapter 36. Since 
the photon does not have a rest frame 
the helicity is the most useful quantity 
to use in talking about its spin. 


We see that the right-circularly polarized photon is in an S z = 1 eigenstate and 
also the h = +1 (positive) eigenstate of helicity. It’s easy to show that the left- 
circularly polarized photon is in the S z = — 1 state and has negative helicity. You 
may confirm that the disallowed polarization ej_ 3 (c/) = (0,0,0,1) would correspond 
to a longitudinally polarized photon. Note that this polarization is allowed for the 
vector particle from Chapter 13 as vector fields have three degrees of freedom 
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Chapter summary 

• A theory which has a field A^ix) introduced to produce an in¬ 
variance with respect to local transformations is known as a gauge 
theory. The field A^(x) is known as a gauge field. Theories must 
be invariant with respect to the transformations: 

ip(x) -> ip(x)e ia ( x \ 

Afj,{x) -> A^x) - d^a(x). 

q 

(14.35) 

• Applied to the Lagrangian for electromagnetism, £ = — \F I1V F IJ ' V , 
we have canonically quantized electromagnetism. 


Exercises 

(14.1) Fill in the algebra leading to eqn 14.32. 

(14.2) A demonstration that the photon has spin-1, with 
only two spin polarizations. 

A photon 7 propagates with momentum = 
(|q|, 0, 0, |q|). Working with a basis where the 
two transverse photon polarizations are e^ = 1 (q) = 
( 0 , 1 , 0 , 0 ) and e^ =2 (q) = ( 0 , 0 , 1 , 0 ), it may be 
shown, using Noether’s theorem, that the opera¬ 
tor S z , whose eigenvalue is the z-component spin 
angular momentum of the photon, obeys the com¬ 
mutation relation 

= K\ =1 *(<l) & U=2 ~ ie A = 2 *(<?)agA=i- 

(14.36) 

(i) Define creation operators for the circular polar¬ 
izations via 

(“Ji + ioJa) . 

b lr = yi (“li - i«q 2 ) • (14.37) 


Show that 

[s z > b U] = 

[s 2 ,&; l ] = -s; L . (mas) 

(ii) Consider the operation of S z on a state \^ q \) = 
&tj 0 ) containing a single photon propagating along 
z: 

S z | 7 a) =^S; a |0>, A = R,L. (14.39) 

Use the results of (i) to argue that the projection of 
the photon spin along its direction of propagation 
must be S z = ±1. 

See Bjorken and Drell Chapter 14 for the full ver¬ 
sion of this argument. 




Discrete transformations 



In Chapter 9 we have explored how quantum fields behave under various 
spacetime translations and rotations. Such transformations are contin¬ 
uous and are thus represented by continuous groups (Lie groups). How¬ 
ever, discrete transformations are also possible, represented this time by 
finite groups, and this chapter examines some of the most commonly 
encountered and important discrete transformations. 


15.1 Charge conjugation 

A particularly interesting discrete transformation is one which changes 
all particles into their antiparticles (see Fig. 15.1). This is accomplished 
by an operator C: this operator flips not only the sign of a particle’s 
charge, but also its lepton number, its hypercharge, and all the ‘charge¬ 
like’ numbers which characterize a particular particle. We can write 

c|p> = Ip), (i5.i) 

where p is the antiparticle of a particle p. Let’s say that p has charge 1 
q. If this charge can be measured by an operator Q then Q\p) = q\p), 
whereas Q\p) = -q\p). Thus CQ\p) = qC\p) = q\p), but QC\p) = Q\p) = 
—q\p}- This implies that Q C = —C Q, or equivalently 

C-'QC = -Q. (15.2) 

For the operator C to exchange particles and antiparticles we need 

C -1 d p C = b p and C _1 6pC = dp. (15.3) 

Since a complex scalar field tp(x) can be written +x) = J p (a p e~ lp ' x + 

b^ p e ip ' x ), we must have 2 CA+C = + . 


Example 15.1 

Since C 2 = I , the eigenvalues of C can only be ±1. Most particles are not eigenstates 
of C, since if they were then C|p) = | p) = ±| p), but this would mean that | p) is the 
same state as | p) and so the particle is its own antiparticle. This is true in some cases, 
specifically for particles which have no quantum charges . 3 In the case of the photon 
( 7 ), it is an eigenstate of C with eigenvalue —1, since if you change all particles to 
their antiparticles, the electromagnetic field reverses (A^ —> —A^). It is also the 
case with the neutral pion 7 r° but here the eigenvalue is +1. This explains why the 
reaction 7 r° —^ 7 + 7 is allowed [the eigenvalues of C go from +1 —> (—1) x (—1)], but 
you cannot have 7r° —>7 + 7 + 7 . 
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Fig. 15.1 The charge conjugation oper¬ 
ator C turns a particle into its antipar¬ 
ticle, reversing its charge and all other 
‘charge-like’ numbers. 

^By ‘charge’ we could mean any prop¬ 
erty like charge, or lepton number, or 
baryon number, etc. These are generi- 
cally called quantum charges. 


2 Note that 

# = f (a), e l P- x + b p e~ ip x ). 

Jp 


3 

In other words for particles which have 
zero electrical charge, zero lepton num¬ 
ber, zero baryon number, etc. 
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Fig. 15.2 A parity operation consists 
of reflection in a mirror followed by a 
7r-rotation about an axis perpendicular 
to the mirror. The mirror reverses a 
single spatial coordinate (the one per¬ 
pendicular to the plane of the mirror), 
the rotation reverses the other two. A 
scalar is unaffected by this transforma¬ 
tion, whereas an ordinary (polar) vec¬ 
tor changes sign. A pseudovector (axial 
vector), such as a vector product of two 
ordinary vectors, is unchanged. 


15.2 Parity 


When you look in a mirror, things look the same but there are subtle 
differences. Five screws in your hand are reflected into five screws, so 
numbers of things (scalars) don’t change. But look carefully and you’ll 
see that the threads in the mirror screws turn in the opposite sense 
to those in the real world. A mirror image has the spatial coordinate 
perpendicular to the surface of the mirror reversed, with the other two 
spatial coordinates left the same. This effect cannot be achieved by a ro¬ 
tation. If you follow a mirror operation by a 180° rotation about an axis 
perpendicular to the mirror, you will have reversed all three coordinates 
and this transformation is known as inversion. This transformation 
can be performed using the parity operator P which acts to invert spa¬ 
tial coordinate, i.e. mapping x —> — x. This means that the position 
operator x will anticommute with the parity operator: 


ceP = —Pi;, 


(15.4) 


or equivalently 


P 1 &P = —x. 


(15.5) 


mirror 7 r 



Fig. 15.3 Electric field E behaves like 
an ordinary (polar) vector, while mag¬ 
netic field B and angular momentum L 
(derived from vector products of polar 
vectors) are pseudovectors (axial vec¬ 
tors). The asterisk indicates a partic¬ 
ular vector in the electric field from a 
point charge, and in the magnetic field 
from a current loop, so that you can 
focus on how a specified vector trans¬ 
forms. 

The group multiplication table for the 
parity operator P and the identity op¬ 
erator I: 


I 


I 

P 


I 

P 


P 


P 

I 


The effect on the momentum coordinates is also p —> —p, and hence 

P _1 pP = —p. (15.6) 

However, the commutation relation [x,p x ] = i will be preserved. Since 
P is a Hermitian operator, and since also it is its own inverse (P 2 = /, 
where / is the identity) then P is also unitary. The parity operation has 
no effect on scalars but reverses vectors (see Fig. 15.2, and for physical 
examples see Fig. 15.3). In fact, there are a class of scalars and vec¬ 
tors for which this is not true: a pseudovector (sometimes called an 
axial vector) is formed by a cross product between two ordinary vec¬ 
tors (which are sometimes called polar vectors) and a pseudoscalar is 
formed via a scalar triple product; these behave oppositely. In summary: 

P (scalar) = scalar P (vector) = —vector 

P (pseudoscalar) = —pseudoscalar P( pseudovector) = pseudovector. 

Since P 2 = I then the group formed by the elements {/, P} under mul¬ 
tiplication is isomorphic to Z 2 , the cyclic group of order 2, pretty much 
the simplest non-trivial group you can imagine. As for C, this means 
the eigenvalues are ±1. Scalars and pseudovectors have parity eigen¬ 
value +1, while vectors and pseudoscalars have parity eigenvalue — 1. 
For example, the photon is an excitation in a massless vector field and 
therefore has intrinsic parity — 1. The pion is described by a pseudoscalar 
field and also has parity —1. We will later find (from using the Dirac 
equation and considering what are called spinor fields) that the parity 
of a fermion is opposite to that of its antiparticle. 

In general, the problem we face is that our quantum fields can be 
scalar-valued fields or vector-valued fields or even more complicated ob- 
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jects. Thus a parity transformation will affect the nature of the field 
itself, as well as acting on the coordinates in which the field is defined. 


Example 15.2 

For now, let’s just keep things simple and focus on a scalar field. If we operate on a 
scalar field then </>(£,*) —» (f)(t,—x). Let’s look at what it does to the creation and 
annihilation operators. We have 

P~ 1 4>(t,x)P = 4>{t,-x) = [ dpe-^+P ^ + ale^ Et+p x \ (15.7) 

Jp 


and this will work if 


P~ 1 a p P = d-p, P' 1 aj ) P = a 1 l p , (15.8) 

which means that in the case of scalar fields the parity operator simply reverses the 
momentum on the creation and annihilation operators. 


15.3 Time reversal 

‘If I could turn back time...’ With the time-reversal operator T you 
can! It maps a scalar field x) —> x) and so while it leaves the 

position operator well alone: 


T 1 xT = x, 


(15.9) 


it reverses momentum coordinates: 


T 1 pT = — p. (15.10) 

This means that the commutation relation = i will only be pre¬ 

served if T _1 iT = —i, and hence T must be antiunitary. 4 The archetypal 
antiunit ary operator is K, the complex conjugation operator and in fact 
you can make a general antiunitary operator by forming the product of 
K and U, where U is a unitary operator. Accordingly we write 

T = UK, (15.11) 


which means that 5 U = TK. 


4 An antiunitary operator has T 2 = —I 
(compare this with a unitary operator 
U which satisfies U 2 = I). Note also 
that the condition 

T 1 iT = -i 

can be written iT = — Ti, meaning that 
i anticommutes with T. This looks non¬ 
sensical at first sight: how can a simple 
number like i anticommute with any¬ 
thing? However, T operates on every¬ 
thing to the right of it, and if there is 
a factor of i there, its sign will be re¬ 
versed. In the simple case that T = K, 
where K is the complex conjugation op¬ 
erator, this condition is easily demon¬ 
strated since 


Example 15.3 

(i) For spinless particles you can take U to be the identity and T = K. This is not so 
surprising since the complex conjugate of the Schrodinger equation 

"V ~ i(15.12) 


T (h/;) = -iT0, 

where ijj is any function. 

5 

By postmultiplying both sides of 
eqn 15.11 by K. 


. dip* _ . dip 
1 ~dt~ ~ 'di-t)’ 


(15.13) 
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and so the combined operation of time reversal and complex conjugation leaves the 
form of the Schrodinger equation invariant. Thus indeed it looks like K and T are 
the same thing. 

(ii) Adding spin complicates things because angular momentum is reversed when time 
reverses (it corresponds in some general sense to things going round in a particular 
sense as a function of time). Thus 

T 1 ST = —S, (15.14) 


^Remember the form of the Pauli spin 
matrices in which only <j y has imagi¬ 
nary components, while cr x and cr z are 
real. 


where S is the spin operator. The operation of complex conjugation is a bit more 
complicated since 6 

K-^K = S x , K-'SyK = -Sy, K~ 1 S Z K = S Z . (15.15) 

An appropriate form for U is U = exp(—i7 tS v ) so that 


T = exp(— i7rS y )K. 


(15.16) 


7 The proof of this goes as follows: T 2 = 
UKUK = UU* but since U is unitary 
lit = (U*) T = U- 1 and so T 2 = 
U(U T ) — 1 = X where X is a diagonal 
matrix containing only phase factors. 
Thus U = XU T and thus U T = UX T = 
UX and thus U = XUX and hence the 
diagonal elements of X are only ±1. 
Thus T 2 = ±1. 


^ We are using the fact that T is antiuni¬ 
tary and this implies Ta\-ij>) = a*T\'tp). 


Hendrik Kramers (1894-1952). 


This operator has the property 7 T 2 = d=l. For a system containing many particles 
with spin, the time-reversal operator can be written 

T = exp(— mSi y )K, (15.17) 

i 

where the product is evaluated over all the particles. Let’s put some flesh on these 
bones. For a single electron, the spin is one-half and so exp (—Hr Sy) will have eigen¬ 
values d=i and so T 2 = — 1. This will also be the case for an odd number of electrons 
but if the number of electrons is even then T 2 = 1. 

These expressions have some interesting consequences for the case in which T 2 = 

— 1 (for an odd number of electrons). For a Hamiltonian 7 ~i that is invariant under 
time-reversal (so that 7 ~i commutes with T), both a state |?/>) and its time-reversed 
version T\ip) will have the same energy. Are they the same state? If they were, then 
T|^) = a\ip) where a is a complex number. But in that case 8 T 2 \ijj) = T ot\ip) = 
a*T\i/j) = |o:| 2 but since we have assumed T 2 = —1 then we would deduce |a| 2 = 

— 1 and that’s not possible since \a\ 2 > 0. We have arrived at a contradiction of our 

initial assumption, so we can deduce that |-0) and are linearly-independent states 

and are known as a Kramers doublet. We have deduced Kramers’ theorem 
which states that the energy levels of a time-reversal invariant system with an odd 
number of electrons are n-fold degenerate where n is even. Essentially the energy 
levels come in pairs of Kramers doublets, and you can only split these pairs by 
introducing a perturbation that breaks time-reversal, such as a magnetic field. 


Example 15.4 

Reversing time t using T and reversing spatial coordinate x using P produces a 
reversal of spacetime x. On a scalar field this can be written 

( PT)- 1 4>(x)(PT) = j,(-x ). (15.18) 

This operation leaves the creation and annihilation operators unchanged: 

(PT)- 1 a p (PT) = a p , (PT)- 1 at(PT) = aJ, (15.19) 

and this occurs because a parity transformation flips all the momenta and the time 
reversal flips them all back. In the mode expansion of the scalar field the only effect 
is to flip the sign of i in the exponentials. It thus acts as an operator to perform a 
complex conjugation of the scalar field. 
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Each of C, P and T are conserved in nrany-particle physics processes, 
but not all. Famously, P is ‘violated’ in the weak interaction. However, 
a quantum field theory that satisfies a fairly minimal set of assump¬ 
tions (ones in which C is Lorentz invariant, local, Hermitian and normal 
ordered) possesses the symmetry CPT. In other words, if you reverse 
spacetime and replace particles by antiparticles, the theory should be 
invariant. The proof of this CPT theorem is (in brief outline only) 
based on showing that (CPT) _1 £(x)(CPT) = £(— x), and hence deduc¬ 
ing that CPT commutes with the Hamiltonian and is hence a symmetry. 9 

A consequence of the CPT theorem is that the mass and lifetime of any 
particle is identical to that of its antiparticle. If experiment ever provides 
conclusive evidence of a deviation from this then CPT symmetry will 
have been shown not to hold in some particular case, casting doubt on 
Lorentz invariance. So far, CPT symmetry has survived the test. 


9 The first part can be deduced using 
the fact that £ is a Lorentz scalar and 
hence any term in it contains terms in 
which all tensor indices are contracted, 
so the terms contain an even number 
of tensor indices and under CPT these 
produce an even number of minus signs. 
More details can be found in the books 
by Itzykson and Zuber, Srednicki and 
Weinberg. 


15.4 Combinations of discrete and 
continuous transformations 


We have previously examined the group of three-dimensional rotations. 
These can be represented by orthogonal 3x3 matrices (let’s call them R) 
but with the condition that the determinant of the matrix is det R = 
+1. The group which describes this is known as SO( 3), the special 
orthogonal group, the ‘special’ being the determinant being +1. These 
rotations are called proper rotations. 

The parity transformation can be represented by a matrix 


/-I 0 0 \ 

0 -1 0 , (15.20) 

0 —lj 

[or, for short, diag(—1, —1, —1)] because it maps (x, y, z) —> (—x, —y, —z) 
and this matrix clearly has determinant —1. By combining this single 
transformation with our group 50(3), we get a much larger group 0(3), 
the group of all orthogonal 3x3 matrices. 10 This includes the proper 
rotations of 50(3) (with detR = +1) as well as improper rotations 
in which detR = — 1. Improper rotations are ordinary, proper rotations 
followed by a parity operation (or equivalently by a reflection). 


10 Note that orthogonality implies 
R T R = I. Hence taking the de¬ 
terminant, detR x detR t = 1. 

Now det R = det R T and hence 
(detR) 2 = 1 or detR = ±1, showing 
that 0(3) contains matrices with both 
det R = +1 and det R = —1. 



Example 15.5 


(i) Reflection by a mirror in the x-y plane is accomplished by diag(l, 1, —1). This 
has determinant —1 (an improper rotation). 

(ii) Rotation about the z-axis by n is accomplished by diag(—1, —1,1). This has 
determinant +1 (proper rotation). 

(iii) The parity operation, diag(—1, —1, —1), is accomplished by the product of (i) 
and (ii). It is therefore an improper rotation. 


Fig. 15.4 The group 0(3) consists of 
two disjoint sets, which can be pictured 
as two disconnected islands. These sets 
are distinguished by their elements ei¬ 
ther having determinant +1 [this island 
is 50(3)] and —1. Only the first is¬ 
land is independently a group because 
the identify transformation has deter¬ 
minant +1 and lives exclusively on the 
left island. 
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Fig. 15.5 The Lorentz group 0(1,3) 
consists of four separated components. 
The identity element resides in the 
component SO"*" (1,3), the proper, or- 
thochronous Lorentz subgroup. The 
other ‘islands’ can be reached using the 
operators P, T or the combination PT. 



Fig. 15.6 Closed paths in the space 
SO( 3), represented as a ball in which 
antipodal points on the surface are 
identified, (a) A path from the north 
to south pole cannot be continuously 
deformed to a point, while (b) a double 
path can. 


The group SO( 3) is a connected group, meaning that you can imagine 
continuously wandering through all the elements of the group; the group 
is not the union of two or more disjoint sets. The same cannot be said of 
0(3). It consists of two disjoint sets, one set with determinant +1 [which 
is <S'0(3)] and the other with determinant —1 and is a disconnected 
group (see Fig. 15.4). 

We find something similar when we consider the Lorentz group, the 
group containing all rotations, reflections and Lorentz boosts. This is 
often given the symbol 0(1,3), emphasizing that it has time-like (one¬ 
dimensional) and space-like (three-dimensional) parts. This consists of 
four topologically separated components because as well as P we have 
to consider T. In a four-dimensional representation we have 

P = diag(l,-l,-l,-l), T = diag(-1,1,1,1). (15.21) 

The subgroup of the Lorentz group that doesn’t reverse spatial or tem¬ 
poral coordinates is the proper, orthochronous Lorentz subgroup 
50 + (l,3). This connected group is one of the four components of the 
Lorentz group. The other three components can be accessed by taking 
elements of SO + (1,3) and (i) including P, or (ii) including T, or (iii) 
including both P and T (see Fig. 15.5). 

While wading in these deep topological waters, it is worth making 
some important remarks about simple rotations which will have impor¬ 
tant consequences later. Let’s consider proper rotations which we have 
been representing using the group S'0(3). Let’s think about the topol¬ 
ogy of this. A rotation is characterized by a rotation axis and an angle. 
Thus we could imagine a sphere with radius 7r and then any point in¬ 
side the sphere could represent a rotation: the direction from the origin 
would determine the rotation axis and the distance from the origin would 
determine the rotation angle. However, a rotation of tt about a partic¬ 
ular axis is equivalent to a rotation of —tt about the same axis. Thus 
the topology of SO(3) is equivalent to that of a ball in which antipodal 
points on the surface are identified. 

This topological space is clearly connected but is not simply con¬ 
nected, which means that a closed path through the space cannot be 
continuously shrunk to a point. To show this consider a path running 
from the north pole to the south pole of the sphere [Fig. 15.6(a)]. You 
can’t deform this continuously to a point. If you move the path near 
the north pole to the right, the point near the south pole moves to the 
left. You can’t get the two to join up. However, if you run the path 
twice from the north pole to the south pole, you can do it [Fig. 15.6(b)]. 
This shows that a 47 t rotation is continuously deformable to a point, 
while a 2tt rotation is not. This is the basis of the famous Dirac scis¬ 
sors parlour-game trick, an amusing demonstration in which you thread 
string around the eyes of a pair of scissors and a nearby chair. Two full 
rotations of the scissors leave the string apparently highly tangled but 
with a few deft movements which do not involve further rotations you 
can untangle the string. The string serves to make the trick more com¬ 
plicated than it need be, and a more transparent version can be carried 
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out using a scarf or a belt, with one end fixed by, e.g. placing it under a 
book, as shown in Fig. 15.7(a). The free end is rotated by two full turns 
(i.e. by 720 degrees) in the same sense and looks very twisted, as shown 
in Fig. 15.7(b). It can be untwisted without rotating the free end in the 
opposite sense, simply by passing the free end around the middle of the 
belt and pulling taut, see Fig. 15.7(c)-(f). Try it! 

To show this point up more clearly in group theory, we could also 
use another group to represent rotations: 517(2). This is the special 
unitary group of 2 x 2 matrices, where again the ‘special’ means the 
determinant is one. This can be used to rotate spinors. 


Example 15.6 

To rotate by an angle 0 about an axis defined by ft we can use the rotation matrix 
which we’ve previously written as R (6). Here we will write 6 = 0n , and we will 
employ a useful identity which is 

/ „ „ \ 

(15.22) 

where a = (<j x , cr y , cr z ) are the Pauli spin matrices. Our rotation matrix, which we 
will now write as R(fi, 0), is given by 


.0 


e 


e 


R(n, 0) = exp ( — i — a ■ n ) = I cos-i sin — an. 


(15.23) 


where I is the identity matrix. These matrices have an interesting feature, which is 
that while as you would expect 

R(fi, 0) = /, (15.24) 

you also have 

R(ri, 2n) = -1 , (15.25) 

and you need a full 47 t rotation to recover 

R(ri, 47r) = I. (15.26) 


The previous example shows that 51/(2) is actually a double cover of 
50(3), meaning that for a particular rotation there are two represen¬ 
tations in 51/(2) for every one representation in 50(3). For example, 
the identity (no rotation at all) is just diag(l, 1,1) in 50(3), but is 
represented by both diag(l, 1) and diag(—1, —1) in 517(2). 

A spinor can be written as a two-component entity (^j where a and 

b are complex numbers and |a| 2 + \b\ 2 = 1. Writing a = Xo + iaq and 
b = x 2 + ix 3 where are real we see that this condition is equivalent 
to Xq + x\ + x 2 + x 2 = 1 and so 517(2) is isomorphic to 5 3 , that is 
the 3-sphere. 11 Thus 517(2) is simply connected, in contrast to 50(3). 
The formal relationship between the two groups is 50(3) = 517(2)/Z 2 , 
meaning that 50(3) is a quotient group. 

These arguments can be generalized to the connected component of 
the Lorentz group: 50(1, 3) = SL( 2, C)/Z 2 , where SL( 2, C) is the group 
of 2 x 2 complex matrices with unit determinant. This gives you some 
flavour of the way in which the mathematical structure of the spaces 
describing the spacetime transformations can be described using group 
theory. 



Fig. 15.7 Dirac’s scissor trick is more 
easily demonstrated with a belt with 
one end free and the other end held 
fixed by placing it under a book. [Fig¬ 
ure from S. J. Blundell, Magnetism: A 
Very Short Introduction, OUP (2012).] 


It may be helpful to remember: 

• S' 1 is a circle, a one-dimensional 
space that we normally draw on 
a two-dimensional piece of pa¬ 
per. It can be described with the 
equation x 2 + y 2 = 1. 

• S 2 is a 2-sphere, or more com¬ 
monly just a sphere (meaning 
the surface of a ball), a two- 
dimensional space that can be 
embedded in three-dimensions. 
It can be described with the 
equation x 2 + y 2 + z 2 = 1. 

• S 3 is a 3-sphere, a three- 
dimensional space that can be 
embedded in four-dimensions. It 
can be described with the equa¬ 
tion x 2 + y 2 + z 2 + w 2 = 1. 

For more details on topological nota¬ 
tion, see Chapter 29. 
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circle 

S 1 

(, x °) 2 + (a: 1 ) 2 = 1 [5 1 = £7(1)] 

sphere 

S 2 

(a; 0 ) 2 + (a; 1 ) 2 + (x 2 ) 2 = 1 

n-sphere 

S n 

(a; 0 ) 2 + (a: 1 ) 2 -|-+ (a;") 2 = 1 

n-torus 

rpn 

S 1 x S 1 x • • • x 5 1 (n > 2) 

real n x n matrices 

general linear group 

GL(n, R) 

n x n real matrices 

special linear group 

SL(n , K.) 

GL(n , M) with det M = 1 

orthogonal group 

O(n) 

GL(n, K) with MM T = M T M = I 

special orthogonal group 

SO{n ) 

0(n) with det-M = 1 

Lorentz group 

0(1,3) 

GL(n,M) with MgM T = g 

special Lorentz group 

50(1,3) 

0(1,3) with detM = 1 

complex n x n matrices 

general linear group 

GL{n , C) 

n x n complex matrices 

special linear group 

SL(n, C) 

GL(n, C) with det M = 1 

unitary group 

U(n) 

GL(n, C) with MM+ = M+M = I 

special unitary group 

SU(n) 

U(n) with det M = 1 


Table 15.1 Some useful mathematical objects. 


Chapter summary 

• Discrete symmetries include charge conjugation C (which changes 
all particles into their antiparticles and vice versa), parity P which 
inverts spatial coordinates and time-reversal T which reverses time. 

• The combined symmetry CPT holds for a Lorentz invariant, local, 
Hermitian quantum field theory. 

• For reference, some useful mathematical objects are tabulated in 
Table 15.1. 


Exercises 

(15.1) Why is the reaction 7 r° —> 7 + 7 + 7 not allowed? pseudovector; (e) the scalar product of two vectors; 

(15.2) Classify the following as scalars, pseudoscalars, vec- ( f ) the scalar P roduct of two pseudovectors. 

tors (polar vectors) or pseudovectors (axial vec- (15.3) Find representations for the spinor rotation matri- 
tors): (a) magnetic flux; (b) angular momentum; ces (a) R(a;,0), (b) R (y,0) and (c) R(z,$). 

(c) charge; (d) the scalar product of a vector and a 




Part IV 


Propagators and 
perturbations 

Quantum field theory is particularly challenging when one has to deal 
with interacting systems. One method to calculate results uses perturba¬ 
tion theory and this can be evaluated using various techniques involving 
Green’s functions and Feynman diagrams. This part introduces some of 
these techniques. 

• In Chapter 16 we introduce Green’s functions and show how they 
are related to a propagator, the amplitude that a particle in some 
spacetime position will be later found at another spacetime posi¬ 
tion. 

• We connect Green’s functions to propagators in a quantum field 
picture in Chapter 17 and introduce the Feynman propagator which 
involves time-ordering of operators. This is applied to Yukawa’s 
model of virtual particle exchange. 

• Interactions are added to the picture in Chapter 18 and in partic¬ 
ular the idea of the S-matrix. We work this out in the interaction 
representation, another way of writing down states and operators 
which turns out to be very useful for these problems. These ideas 
are applied to a perturbation expansion and use is made of Wick’s 
theorem which provides a method for simplifying products of op¬ 
erators. 

• In Chapter 19 we introduce Feynman diagrams which provide a 
wonderful pictorial way of visualizing terms in the perturbation 
expansion. Rather than evaluating umpteen integrals we can sum¬ 
marize many complex calculations in diagrammatic form and eval¬ 
uate the results using Feynman rules for a particular situation. 

• These ideas are applied to scattering theory in Chapter 20 for a 

simple model which serves as a toy model of QED and the 

scattering cross-section is defined and then evaluated. 





Propagators and Green’s 
functions 


16.1 What is a Green’s function? 

144 

16.2 Propagators in quantum me¬ 
chanics 146 

16.3 Turning it around: quantum 
mechanics from the propaga¬ 
tor and a first look at pertur¬ 


bation theory 149 

16.4 The many faces of the prop¬ 
agator 151 

Chapter summary 152 

Exercises 152 


^As Feynman notes, quantum ampli¬ 
tudes are like Hebrew in that you read 
them right-to-left. 


2 These are named after George Green 
of Sneinton, Nottinghamshire (1793- 
1841) who was perhaps the greatest 
self-taught mathematical physicist of 
modern times. His achievements are all 
the more remarkable since, until 1829, 
his day job consisted of running a wind¬ 
mill. While still a miller he published 
his most celebrated work ‘An Essay on 
the Application of Mathematical Anal¬ 
ysis to the Theories of Electricity and 
Magnetism’ (1828). The essay included 
the first use of what are now called 
Green’s functions. 


When will the world know that peace and propagation are 
the two most delightful things in it? 

Horace Walpole, 4th Earl of Oxford (1717 1797) 

One way to do quantum mechanics is to calculate a wave function and 
operate on it with quantum operators. Another way is to directly con¬ 
sider amplitudes for a given process, such as ‘the amplitude that my 
particle starts at point y at a time t y and ends up at point x at time 
t x \ which we write down 1 as (x(t x )\y(t y )}. This amplitude is known 
as a propagator. We’ve spent time in previous chapters removing the 
wave function from its role as the most useful entity in quantum physics 
and we’ll see that propagators represent an alternative to wave functions 
and enable us to extract all of the same information. In fact, we’ll see 
that wave functions are actually special cases of propagators and con¬ 
tain less information! For our purposes, propagators represent the most 
economical way to calculate all of the properties of quantum fields in an 
interacting system of many particles. In addition propagators for sin¬ 
gle particles have a neat mathematical property: they are the Green’s 
functions 2 of the equation of motion for a particle. 

The plan for this chapter is as follows: (i) we’ll explain what’s meant 
by a Green’s function; (ii) we’ll demonstrate that propagator amplitudes 
are the Green’s’ functions of quantum mechanics: they tell us the am¬ 
plitudes for a particle to start at a spacetime point ( y,t v ) and then be 
detected at a point (x, t x ); (iii) finally we’ll show that quantum mechani¬ 
cal perturbation theory is conveniently expressed in terms of propagators 
and that these propagators are most conveniently expressed in terms of 
cartoons. 


16.1 What is a Green’s function? 

Here’s a very general looking differential equation formed from a linear 
differential operator L: 

Lx(t) = f(t). (16.1) 

The Green’s function G(t , u) of the linear operator L is defined by the 
equation 


L G(t, u ) = 8{t — u). 


(16.2) 
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Notice that the Green’s function G(£, u) is a function of two variables, t 
and u. 


Example 16.1 

Let’s consider a familiar example from mechanics. We’ll take an oscillator of mass m 
and spring constant K evolving under the influence of a time-dependent force f(t). 
The equation of motion for this is 

d 2 

+ Kx (t) = f(t), (16.3) 

,2 

so here the linear operator is L = m + K. We can build up a force function f(t) 
by adding together lots of delta functions. 

y /*oo 

d uf(u)S{t - u), (16.4) 

0 

where u us a dummy variable which allows us to break f(t) into deltas. This is, of 
course, merely an example of superposition and is allowed because our differential 
equation is linear. 

Instead of hitting the problem head-on, we sidestep and solve eqn 16.3 for just one 
of the delta functions. We’ll call this solution the Green’s function G(t,u ): 


d t 2 


-K 


G(t , u ) = S(t — u). 


(16.5) 


The complete solution to eqn 16.3 is then given by the integral of the Green’s function, 
weighted by the force function f(u): 

y roo 

d uG(t,u)f(u). (16.6) 

o 

The solution is therefore simply a sum of the responses of the system in x to the 
stimuli f(u). 

Why is the solution to eqn 16.3 given by eqn 16.6? Start by acting on eqn 16.6 
with the operator L (which passes through the integral sign on the right-hand side), 

Lx(t) = / duLG(t,u)f(u) 

= f duS(t — u)f(u) by eqn 16.2 
= f(t) by eqn 16.4, (16.7) 


which is where we started! This implies we can solve an inhomogeneous differential 
equation by finding the Green’s function Git. u) and then integrating over f(u) to 
get the solution x(t). 


It’s now clear why the Green’s function G(t, u ) needs two arguments: 
one is the variable we’re interested in (here it’s the time t), the other u 
is the variable that describes the position of the delta function S(t — u) 
that we use to define the Green’s function via LG(t,u) = S(t — u). The 
inhomogeneous part of the equation [here f(t )] is built up from a set 
of delta functions weighted by an amplitude function f(u), that is a 
function of the dummy variable u. 

Now we know what use they are, we can find the Green’s functions 
for some commonly encountered L operators. 3 


3 Remember that <5(a;) = S(—x) so we 
can swap the order of the arguments in 
the delta function for convenience. 
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^(1+1)-dimensional spacetime means 
one spatial dimension and one time 
dimension. We normally think about 
(3+l)-dimensional spacetime with 
three spatial dimensions and one time 
dimension. 

c 

We don’t need to integrate over 
all time because the multiplication 
G(x,t x ,y,t y )(f)(y) does that part au¬ 
tomatically, as in the case of the 
time-evolution operator which, as we’ll 
shortly see, is closely related to the 
Green’s function. 


^With this in mind we call G~*~ the 

time-retarded Green’s function, 

defined as 


G+ = 



tx ^ ty 
tx ^ ty , 


or more simply 

G+ = 0(t x - ty)G. 


(16.14) 


(16.15) 


We can also define the time-advanced 
Green’s function by 


( 0 tx ty 

G t X < ty, 


(16.16) 


or more simply 


G~ = 0{t y — t x )G. (16.17) 


Example 16.2 

Examples of finding some Green’s functions. 

• The Green’s function for L = V 2 can be read off using an argument 
from electromagnetism. We know Poisson’s equation (reverting to SI units): 
V 2 P(a?) = — , which for a point charge of magnitude unity, located at u , 

reads 

e 0 V 2 y(a:) = -S (3) (x - u). (16.8) 

The potential that solves this equation is known from electromagnetism to be 

V(x) = -r-J-r, and so 

v ' 47TC0 \x—u I 7 


V 2 

1 

3 

II 

(16.9) 


_4n\x — u\ _ 

or 


1 


G(a ?, u) = — 

(16.10) 

4tt\x — u\ 

• The Green’s function for L 

= (V 2 + fc 2 )is defined by 


(V 2 + k 2 ) G k (x,u) = <5 (3) (ec — it). 

(16.11) 

The solution which will interest us is 



G fe (x-u) = 

e i\k\\x — u\ 

4n\x — u\ 

(16.12) 


which describes the amplitude of an outgoing, spherical wave. 


16.2 Propagators in quantum mechanics 

We know that the equation that governs the change of a wave func¬ 
tion 4>(x,t) is the Schrodinger equation H(j){x,t) = i, w j lere j s 
an operator function of x and we’ll only consider (1 + l)-dimensional 
spacetime for now. 4 Why might the Green’s function of the Schrodinger 
equation be useful, and what interpretation might such a function have? 

The real beauty of a Green’s function is the property that we had in 
eqn 16.6, namely that 

4>{x,t x ) = J dyG + (x,t x ,y,t y )(j)(y,ty). (16.13) 

(The + sign notation will be explained shortly.) Here the Green’s func¬ 
tion takes a wave function at some time and place and evolves it to 
another time and place. To find out how it does this we need to inte¬ 
grate over space (but not time). 

In fact, we say that the Green’s function propagates the particle from 
the spacetime point ( y,t y ) to (x,t x ), which explains why we call G + a 
propagator. Here the plus-sign superscript in G + means that we con¬ 
strain t x > t y and we define G + = 0 for t x < t y , which prevents particles 
going back in time. 6 

Looking at eqn 16.13 we can interpret </)(y, t v ) as the amplitude to find 
a particle at ( y,t y ) and <j>(x,t x ) as the amplitude to find a particle at 
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(x,t x ). It follows that the propagator G + (x, t x , y, t y ) is the probability 
amplitude that a particle in state | y) at time t y , ends up in a state \x) 
at time t x . The interpretation means that the Green’s function may be 
written 

G + (x,t x ,y,ty) = 9(t x - ty)(x(t x )\y(ty)). ( 16 . 18 ) 

Note that, using this language, our old friend the wave function 
4>(x,t x ) = (x\(j>{t)) is simply the amplitude that a particle is found at 
( x,t x ) irrespective of where it started. The propagator therefore con¬ 
tains more information, since it cares where the particle started. 


Example 16.3 

From the basic definition of the propagator G+(x, t x , y, t y ) = 6(t x ~ ty)( x (tx)\y(ty)) 
we can invoke the time-evolution operator so that the time dependence is taken away 
from the states 

G + (x,t x ,y,t y ) = 9(t x -ty){x\U(t x -ty)\y) 

= 9{t x -ty){x\e- i{l 9*-ty)\ y) . (16.19) 

We can also expand the amplitudes in terms of the eigenstates of the operator H , 
which we call |n), which have eigenvalues E n , 

G + (x,t x ,y,t y ) = 9(t x - t y )(x\e~ 1 * i( - t x- t y')\y) 

= 9{t x -t y )Y^{x\e~ lH(tx ~ tv) \n){n\y). (16.20) 

n 

Remembering that ( x\n) is just a fancy way of writing a wave function (f) n (x ), we see 
that, in general, the propagator may be written in terms of the eigenfunctions of H 
as 

G + {x,t x ,y,ty) = 9{t x - ty)J2 < t , n(x)<t>n(y) e ~ lEn< ' tx - tv ), (16.21) 

n 

allowing us to relate wave functions and propagators. 


So far we’ve assumed the propagator G + (x, t x , y, t y ) is some sort of 
Green’s function of the Schrodinger equation: now we’ll confirm this. 
We define the Green’s function for the Schrodinger equation by consid¬ 
ering G + {x, t x ,y,t y ) as a function of x and t x only and say 


H. r - i 


dt 7 


G (x,t x ,y,t y ) 


-iS(x - y)S(t x - t y ) = -i 5 (2 \x - y), 

( 16 . 22 ) 


where H x only touches ^-coordinates and not ^-coordinates. Here we 
hold y and t y fixed as dummy variables. Finally, we want to confirm 
that the amplitude (x(t x )\y(t y )) is truly the Green’s function of the 
Schrodinger equation. 


Example 16.4 


Here’s how is goes. The Green’s function is given by 


G + (x,t x ,y,t y ) = 9{t x - ty)(x(t x )\y(t y )) = 9(t x - t y ) ^ (y)e lE ^ tx *v\ 

n 

(16.23) 
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where we’ve explicitly included the 6 function to ensure that the particle can’t prop¬ 
agate backward in time. Substituting the Green’s function into eqn 16.22, we find 
that we want to work out 

e(t x - ty ) y M^)K(y>~ lEn{tx ~ ty) - (ie.24) 


H x — i- 


dt x 


Take this in two stages. Stage I: We use the fact that -^—6{t x — t y ) = 5(t x — t y ), 
to find that the time derivative acting on the retarded Green’s function gives us 


<9t 


-G+ 


i<5(ix -ty)'%2<t>n(x)<l>n(y)e lBn( * x l v) 

n 

+6(t x - t y )^2 E n <j> n (x)<t>n(y)e- lEn< ' tx - t (16.25) 


Stage II: Now considering H x acting on the Green’s function, we note that 
H x (j) n {x) = E n (j){x) is the only thing that is affected by the H x operator. We have 


H X G + = HxO(t x — ty)(x(t x )\y(ty)) = 0(t x - ty)y^ i E n <t> n (x)^(y)e 4 a). 

n 

(16.26) 

Putting all of this together, we obtain 


8 1 

dt x _ 


G~^~ ( x , t x , y , ty') 


-iS(tx -ty)Y^<t>n(x)K(y) e lBn(tx *») 

n 

—iS(tx — t y )S(x — y), (16.27) 


which is what we set out to prove. 


Now let’s try to find a propagator for free particles. Our first stop will 
be the free-particle propagator in position space and the time domain, 
which we call G + (x,t x ,y,t y ). 


We use the replacement 



dp 
2t r’ 


Note for later that we will often make 
the replacement in three dimensions 

V J (a *) 3 

where V is the volume. 


Example 16.5 

A non-relativistic, free particle has a Hamiltonian H = p 2 /2m, with eigenfunctions 
4>(x) = ~^=e ipx and eigenvalues E p = E-. Treating p as a continuous variable, we 
start with eqn 16.21, and transforming the sum into an integral 7 we have 

G+(x,t x ,y,t y ) = 9{t x - t y )L J ^</> p (ad0p(t/)e _lB r (tx_t *' ) 

= 0(t x ~ty)f (16.28) 

This integral can be done: it’s a Gaussian integral, of which, much more later (see 

ax 2 /— b 2 

Chapter 23). For now, we use the result dx e 2 ~+ bx = J ^e^: with a = 
1 ^ tx // ty ^ and b = \{x — y). This gives 

I TYL im(x y) 

G+{x, t x ,y, ty) = 0(tx - ty) e ^tx-t v) . (!6.2 9 ) 

V 2m(tx - ty) 


There are more examples of finding propagators in the exercises at the 
end of this chapter. 
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16.3 Turning it around: quantum 

mechanics from the propagator and a 
first look at perturbation theory 

So far we’ve used things we know about quantum mechanics to derive 
some properties of a single-particle propagator. In fact, we want to turn 
this around. If we start from the propagator, what can we learn about 
the particle? This is most easily seen by considering yet another form 
of the Green’s function, again as a function of space but this time in the 
frequency/energy domain. This one looks like 

T?\ ^n{x)(/)n(y) naon\ 

G + {x, y, E) = 2^ E _ E -■ (16.30) 

n n 

(You’re invited to prove this relation in Exercises 16.2.) 

We can notice two things about this equation when we consider it as 
a function of the complex 8 variable E: (i) the singularities (or poles) on 
the real axis occur when E = E n , that is, when the parameter E equals 
the energies of the eigenstates <j> n {x)', and (ii) the residues at the poles 
are (i times) the wave functions. We see that by being able to write 
down the Green’s function of a system we have access to the energies 
of the system and its wave functions. Propagators, with their ‘from 
here to there’ definition, also have the appealing property that they 
can be drawn in a cartoon form showing a particle travelling from y to 
x. This isn’t quite as trivial as it sounds: Bohr was doubtful whether 
the trajectory of a quantum particle could be thought about due to 
position-momentum uncertainty, but it can. The quantum propagator, 
which gives this trajectory some meaning, is illustrated in Fig. 16.1. 

This doesn’t explain what all the fuss is about. Why are Green’s 
functions so useful? One place to look is perturbation theory. In most 
interesting cases we can’t solve a quantum mechanical problem exactly. 
What we do is write the problem by splitting up the Hamiltonian into 
two parts: H = Hq + V, where Hq is the solvable part and V is the 
perturbing potential. Perturbation theory involves the tedious task of 
evaluating changes to eigenfunctions and eigenvalues in a series of in¬ 
creasingly complicated corrections. 

Let’s look again at our Green’s functions and see if they lend any 
insight to the perturbation problem. For simplicity, we’ll write the 
original Green’s function equation in a symbolic, matrix-like form as 
(H — E)G = — l. 10 (Remember throughout that G just describes a par¬ 
ticle propagating from y to x.) The symbolic Green’s function equation 
is solved by writing G = pzp, which bears a resemblance to the form in 
eqn 16.31. 

Green’s functions allow us to interpret a perturbation problem in 
terms of a propagating particle. We think of the solvable part of the 
problem in terms of a particle propagating from point to point. We 
think of the perturbation V as a scattering process that interrupts 
the propagation. To visualize this we’ll write a symbolic expression for 


8 Readers unfamiliar with complex 
analysis can refer to Appendix B. 


^Actually, to ensure causality it’s nec¬ 
essary to replace E n with E n — ie so we 
have 


G+(x,y,E) 


jj m i < t>n(x)<f>n(.y) 

0+ E — E n + ie 

(16.31) 



Fig. 16.1 A cartoon representation of 
the propagator. It will turn out to be 
the basic unit of the Feynman diagram. 

1 Objects like G should be treated with 
caution: they’re not well defined math¬ 
ematical objects, just symbols com¬ 
bined in the shape of the original equa¬ 
tions that we’re using to suggest rela¬ 
tionships. Needless to say these argu¬ 
ments are repeated rigorously in the lit¬ 
erature. 
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the Green’s function in the presence of a perturbation as G = e-h 0 -v 1 
where we’ve used H = Ho + V. In perturbation problems G (with no sub¬ 
script) is called the full propagator. Just considering the solvable part 
of the Hamiltonian Hq we can write Go = E _ 1 Ho , that is, we can find the 
Green’s function for a solvable problem. The ‘0’ subscript means that 
we’re talking about a particle freely propagating, with no scattering and 
so, in general, Go is known as the free propagator. If we now consider 
the perturbed problem and try to expand the full propagator G as a 
There’s a matrix identity that says function of the free propagator Gq we can say 


- =- B- + — B-B— 

A + B A A A A A A 


In the rest of this section we will 
write the functions as G, rather than 
G+ , since the expressions in eqns 16.33 
and 16.34 apply generally to G, G+ and 
G~. 



G Go GqVGo GoVGoVGo 


Fig. 16.2 The scattering processes 
that lead to the full propagator in per¬ 
turbation theory. 



Fig. 16.3 The derivation of Dyson’s 
equation in graphical form. 


G 


1 _ 1 , 1 

E - H n - V “ E - H 0 + E^" 

+ — - —V—1—V—1— + 

E-Hq E-Hn E-H 0 



1 

E— H 0 


(16.32) 


The point here is that, in general, we can’t calculate 1 / (E — H 0 — V). 
However, we can calculate each of the terms on the right-hand side of 
this expression and we might hope that G may be well approximated by 
just the first few of these. 

Switching back from symbolic relations to real functions 11 shows that 
we’ve succeeded in writing the full Green’s function as an expansion in 
free propagators and potentials. The form of the expansion is 


G — Go + G 0 V Go + GqVGqVGq + ... (16.33) 

This is a memorable equation and easily turned into pictures as shown 
in Fig. 16.2. The perturbation problem, previously involving an infinite 
series of unmemorable terms, has been turned into a vivid picture of 
particle scattering. The amplitude for a particle to go from y to x is 
G. This is a superposition of the amplitude for moving freely from y 
to x (which is Go); added to the amplitude for making the trip with a 
single scattering event at some point along the way G 0 VG 0 ; added to the 
amplitude for making the trip with two scatters GoVGoVGo and so on. 
Thus the algebra of Green’s functions is highly amenable to visualization 
of the physical processes. 


Example 16.6 

The perturbation expansion in eqn 16.33 is just a geometric series which we can 
rewrite as 


G = Go (1 + VGo + VGoVGo + ...) 
Go _ 1 

l-VGo G^-V' 


(16.34) 


This is known as Dyson’s equation. This equation is also derived from the algebraic 
manipulation of the cartoons, as shown in Fig. 16.3. This concept will turn out to be 
very useful. Notice that it is non-perturbative: we don’t consider just the first few 
of the terms in a perturbation series, we sum all of them to infinity. 



16.4 The many faces of the propagator 151 


16.4 The many faces of the propagator 

So far we have encountered the free-particle propagator in the space 
and time domain Gq(x, t x , y, t y ) and in the space and energy domain 
Gq{x,ii,E). There are other useful forms that we can derive. 


Example 16.7 

The physical interpretation of G”*” allows us to work out another expression for the 
Green’s function, this time in the momentum space and in the time domain, so we 
write Gq (p,t x , q,t y ). Remember the interpretation: we want the amplitude that a 
particle that starts off in a state |g) will, after a time t x — t y , end up in a state | p). 
Thus 12 


G%(p,t x ,q,ty) = 6{t x -ty)(p\U(t x -ty)\q) 

= 9(t x -t y )(p\q)e - iB «<*•-*»> 

= e(t x -ty)S(p-q)e- iB ^- t y'>. (16.35) 

We see that the free particle cannot change its momentum state, so having both p 
and q is redundant, so we can write the same equation in the following shorthand: 

Go(p,tx,t y ) = 0(t x (16.36) 

Notice that we often miss out the delta functions if we decrease the number of argu¬ 
ments we list in G + . Strictly we should write all arguments and delta functions. 


For dealing with perturbation theory problems, propagators are often 
most useful to know in the momentum and energy domain. 


Example 16.8 

To get to G J (p, E) we’ll take a Fourier transform of Gq~ (p, £, 0) 
Gt(p,E) = 


Remember again, that if we are being strict we should write this as 

G+(p,E,q,E')= J. S(p-q), (16.38) 

H — Hp 16 

telling us the amplitude for a particle with energy E' and momentum q to enter and 
a particle with energy E and momentum p to leave. The delta function conserves 
energy and momentum. 


/ 

/ 


dte iEt G£{p,t,0) 
d te iEt e(t)e- i( ~ E r- ic)t 


_— Ep-\-ie)t 

E — Ep + ie 


E — Ep + ie 


(16.37) 


12 

Note that, for simplicity in these ex¬ 
amples, we’re using the non-relativistic 
normalization so that ( p\q) = 5(p — q). 


We make the replacement E p —> E p — ie 
to ensure the convergence of the inte¬ 
gral at t = oo. It also effectively ensures 
causality is obeyed. See Appendix B 
and Exercises 16.2 and 16.4 for the de¬ 
tails. 
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As a helpful summary, we assemble a table of all of the flavours of Green’s 
function that we’ve introduced so far. 


Gq (X,t x ,y,ty) 

= 9(t x -ty)^<t) n {x)(j)* n {y)e lEn(tx G) 

n 

Go{x,y,E) 

y' i <i>n{x)<j)* n {y) 

^ E — E n + ie 

n 

G+ ( P , t X , ty) 

— 0(t X — ty)e~ lEp ^ tx ~ ty>) 

Gq {p, E) 

i 

E — E p + ie 


Chapter summary 

• Green’s functions are used as propagators which express the am¬ 
plitude that a particle goes from spacetime point y to spacetime 
point x. Various forms of the Green’s function are shown in the 
table above. 

• Dyson’s equation for the full propagator G is the perturbation 

expansion: G = Gq + GqVGq H-= (Gq 1 — V) -1 where Go is the 

free propagator and V describes a scattering process. 


Exercises 


(16.1) (a) Solve the Schrodinger equation to find the wave 
functions <f n (x) for a particle in a one-dimensional 
square well defined by V ( x ) = 0 for 0 < x < a and 
V(x) = oo for x < 0 and x > a. 

(b) Show that the retarded Green’s function for this 
particle is given by 

G+(n,f 2 ,U) = 6»(t 2 -ti)e"(^) (t2_tl) . (16.39) 


(c) Find G + (n, c o) for the particle. 

(16.2) (a) Using a damping factor e 
G+(x,y,E) = j: 


show that 


i <PnG)<l>n(y) 


E — E n -\- ie 

(b) The ^-function may be written as an integral 



dz e~ izt 
2n z + ie 


(16.40) 


Use this to derive Gq (p, E) — E _ E +ie , without 
recourse to a damping factor. 

(16.3) Consider the one-dimensional simple harmonic 
oscillator with a forcing function f(t) = 
described by an equation of motion: 

m^pA(t — u) + muioA(t — u) = F( uj)e~ lu ’ ( ' t ~ u ' > . 

(16.41) 

(a) Show that the solution to this equation is 

F(u) 

A(t -u) = -^ %- T + B(t), (16.42) 

m oj z — lJq 

where B(t) is the solution to the homogeneous equa¬ 
tion of motion. 

(b) Use this result to show that the general form of 
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the Green’s function G(t, u) for a simple harmonic 
oscillator is given by 


G(t,u) 


l_ f du>e^^> 
m J 2n w 2 — u>q 


+ B(t). 


(16.43) 


(c) If G(t, u) is subject to the boundary conditions 
that at t — 0 we have G = 0 and G = 0, show that 


G + (t,u) = -sin u>o (t — u), 

muio 


(16.44) 


for 0 < u < t. Hint: This can be done with a 
Laplace transform of the differential equation for 
the Green’s function, or by using complex analysis 
to do the integral in the previous problem subject to 
the boundary conditions. 

(d) Show that the trajectory of a particle in a sim¬ 
ple harmonic potential subject to a force /(f) = 
Fo sin wot is given by 2m ° 2 (sin ^ 0 1 — u>ot coswot). 

(16.4) (a) By taking the Fourier transform of the equation 


(V 2 +k 2 )G k (x) = S (3 \x), (16.45) 


show that the momentum Green’s function implied 

iS 1 

Gk(q) = lJ^- (16 ' 46) 

Note that Gk(q) is undefined when q 2 = k 2 . This 
reflects the fact that there is an ambiguity in Gk(q) 
in that we don’t know if the wave is incoming, out¬ 
going or a standing wave. This is sorted out with 


an ie factor as shown in the next part. 

(b) Take the Fourier transform of G(f ( x ) = — e 4 l T |^| l 
to show that, for outgoing waves, 


Gt(q) 


1 

k 2 — q 2 + ie 


(16.47) 


Hint: To ensure outgoing waves, you could include 
a damping factor e -6 ^ to damp out waves for 
|x| —> 00 . 

*(c) (For aficionados of complex analysis.) Take an 
inverse Fourier transform of (q) = fc j_h + . g to 
show that it corresponds to the outgoing wave so¬ 
lution G+(x) = 

Hint: On doing the angular part of the integral you 
should obtain: 


g: 




° |g|d|g| 
q 2 — k 2 — ie 


e i I«11 as I_ e - I® I 


(16.48) 

Extend the lower limit of the integration to |q| = 
—00 and identify the positions of the poles in the 
complex \q\ plane. For an outgoing wave, the con¬ 
tour over which the e 1 ^^ part is integrated must 
be completed in the upper half-plane; the contour 
over which the e -1 ^^ part is integrated must be 
completed in the lower half-plane. 

*(d) What do you expect for an incoming wave so¬ 
lution? 
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An error does not become truth by reason or multiplied prop¬ 
agation, nor does truth become error because nobody sees it. 
Mahatma Gandhi (1869-1948) 
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Interacting and 
theories 

non-interacting 

Non-interacting 

Interacting 

Ho 

H = H 0 + H' 

|0> 

l«> 

G 0 (x,y) 

G(x,y) 

No scattering 

Scattering 


^See Chapter 11. 


We describe the Universe by combining fields to form a Lagrangian den¬ 
sity C[(j>{x)\. Our canonical quantization process often allows us to quan¬ 
tize these fields leading to a Universe pictured as a vacuum acted on by 
field operators like dj,. The excitations of the vacuum that the field 
operators produce are particles and antiparticles. 

Our next task is to come up with a scheme to keep track of these 
particles. Having done away with the wave functions we need some 
objects that contain the information about excitations of the system. 
These objects are propagator amplitudes denoted by the letter G. These 
propagators are analogous to the Green’s function amplitudes discussed 
in the previous chapter. The propagators describe the fate of particles 
in a field theory. 

An important distinction that we will make in this chapter is between 
non-interacting and interacting theories. Non-interacting theories are 
those which can be diagonalized using canonical quantization. 1 Canoni¬ 
cal quantization allows us to describe the system in terms of a vacuum |0) 
and non-interacting particles in momentum states | p) created with oper¬ 
ators like dj,. The Hamiltonian for a non-interacting theory is called H$ 
which may be written as Ho = E p o) p a p , and we have Ho\p) = E p \p), 
where E p gives the energies of the non-interacting particles. In contrast, 
interacting theories cannot be diagonalized with canonical quantization. 
Put another way, these theories contain extra terms describing interac¬ 
tions between particles that we cannot transform away with canonical 
quantization. We call the ground states of interacting theories |f 1} and 
the Hamiltonian H. If we act on |fl) with the operator dj, we won’t 
necessarily get a state | p). Instead we might produce a superposition of 
many particles (whose momenta sum to p). Since most interacting field 
theories cannot be exactly solved to give, for example, the energies of the 
excitations or particles of the theory, we need to develop a perturbation 
process to make approximate calculations. This can be done by analogy 
with the perturbation process we suggested in the last chapter, which 
was based around a series written out in terms of propagators. 





17.1 


17.1 The field propagator in outline 


The definition of the field propagator involves a simple thought experi¬ 
ment. We start with our interacting system in its ground state, which we 
denote by fl). The thought experiment works as follows: we introduce 
an extra particle of our choice. We will use this extra particle to probe 
the system. The new particle is introduced (created) at a spacetime 
point (y°,y). It interacts with the system, possibly causing excitations 
in the fields and all manner of complicated things. Then we remove 
(annihilate) the particle at spacetime point ( x° , x) and ask if the system 
has remained in the interacting ground state |fl). We’re interested in the 
amplitude G + {x,y) for the experiment, given by 

Particle annihilated \ f Particle created \ 
at (a; 0 ,®) J ( at (y°,y) ) 

(1"D 

That is, the probability amplitude that the system is still in its ground 
state after we create a particle at y and later annihilate it at x. The 
amplitude for this process will depend on the complicated interaction of 
the probe particle with the system and, it will turn out, will tell us a 
great deal about the system itself. By analogy with the results of the 
previous chapter, the amplitude G + (x,y) is called the Green’s function 
or propagator. 2 So much for definitions. The mathematical object that 
tells this story of the propagator outlined above is 

G + {x,y) = 9(x° - (y)\n) , (17.2) 



G+(x,y) = (n 


where the + sign on the left tells us that the particle is created at y 
before being annihilated at x and the 0-function on the right guarantees 
this. 


Example 17.1 

To see that this function does the job we can break down the action of the opera¬ 
tors on the states into stages. Remembering that a Heisenberg operator has a time 
dependence defined by cf)(t,x ) = e +lHt ftx)e~ lHt , and substitution of this gives an 
expression for the propagator 

G(x,y) = (n|e i ^ 0 ^(£c)e- i "G°- !/0 ^ t (y)e- iij! ' 0 |n>. (17.3) 

Taking this one stage at a time (and temporarily ignoring the possibility of creating 
and annihilating antiparticles), we see that: 

• e — 1 ^ y °\Q) is the state |Q) evolved to a time y°. 

• ft (y)e~ 1 ^ y |f2) is that state with a particle added at time y° at a position y. 

• ~ y0 ^ ft (y)e~ 1 ^ v ° |f2) is that state time evolved to time x °. 

• Now hitting this state with ftx) removes the particle at time x °. 

• We terminate this string on the left with a (£1| to find out how much of the 
original state |f2) is left in the final state. 

The propagator is, therefore, the amplitude that we put a particle in the system at 
position y at time y Q and get it out at position x time x°. 


The field propagator in outline 155 


o 

We call the propagator for an interact¬ 
ing theory G(x,y ) and its ground state 
| Q ,). For a free theory (with no interac¬ 
tions) we call the propagator Go(x,y) 
[or sometimes we will use the equiva¬ 
lent notation A(x, y)] and the ground 
state |0). 
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17.2 The Feynman propagator 

We don’t always want to deal with the particle propagator G + (which is 
constrained with the 9 -function so that x° > y°) when thinking about 
fields. The reason is that although G + accounts for the movement of 
particles it misses out the essential information about antiparticles. 


Example 17.2 


Notation for propagators 

In general 

G(x,y) 

Free propagator 

Go(x, y) 

Free scalar fields 

A(x,y) 

Photon fields 

D(x,y) 

Fermion fields 

G(x, y) or iS(a;, y) 


Gian-Carlo Wick (1909-1992) 


Q 

This form applies only to the bosonic 
(i.e. commuting) scalar fields we’re con¬ 
sidering in this chapter. For an anti¬ 
commuting Fermi field 1 / 1 , we pick up 
a minus sign for every permutation, so 
we have 


Consider, for example, the field creation operator for complex scalar field theory 
(x) = f d p 3 - -— r (d) p e 1 ' p x + S p e —ip ' x ), which creates a particle and annihi- 

(2tt)2 ( 2 E p )2 V / 

lates an antiparticle. If this acts on the free vacuum ^'(y) |0) (no interactions, so 
nothing unpredictable can occur) then we create a particle at y, but the antiparticle 
part annihilates the vacuum 6 P |0) = 0. Similarly, only the particle part contributes 
to (O| , 0(ir), so the propagator (y)\0) describes a particle propagating from 

y to x, but tells us nothing about antiparticles. 


Richard Feynman struggled over how to include the antiparticle contri¬ 
bution for some time before deciding that the most useful form of the 
propagator was one that summed both particle and antiparticle parts. 
This form of the propagator makes up the guts of most quantum field 
calculations and is called the Feynman propagator. How do we get 
the propagator Feynman used which contains both particle and antipar¬ 
ticle parts? We need to introduce a new piece of machinery: the Wick 
time-ordering symbol T. This isn’t an operator, but is much more 
like the normal ordering symbol N in that it is just an instruction on 
what to do to a string in order that it makes sense. The time-ordering 
symbol is defined for scalar fields 3 as 


TMx°)fcy°) 


(j)(x°)(j){y 0 ) x° > y° 
(f>(y°)(j)(x 0 ) x° < y°, 


(17.4) 


Tfi(x°)ip(y 0 ) 

( '0(x o )'0(t/°) x° > y Q 
1 — ifi(y 0 )'ip(x 0 ) x° < y°. 


so that the scalar fields are always arranged earliest on the right, latest 
on the left. The Feynman propagator is then defined as 

G(x,y) = (n\Tcf)(x)(i>\y)\n} 

= 0(x° - y^inifitx)^(y)\n) + 6{y° - x°)(n\4>(y)^fi(x)\^l), 

(17.5) 


where |f T) is the interacting ground state of the system. The propagator 
is therefore made up of two parts. The first part applies for x° later 
than y°: it creates a particle at y and propagates it to x where it is 
destroyed. The second part applies when y° is later than x°: it creates 
an antiparticle at x and propagates the system to point y. Both processes 
are included in the total propagator. 

If, as in the example above, the system doesn’t contain any interac¬ 
tions then particles just move around passing through each other. In 
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this case we call the ground state |0) and we have the free propagator 
sometimes called Go(x,y) or, for scalar fields, more usually just A (x,y) 
where 


A Or, y) = (0\Tj>(x)ft(y)\0). 


(17.6) 


The free-particle propagator is depicted in Fig. 17.1. The free propagator 
forms an essential part of the structure of all perturbation calculations. 
This is because we think of interactions as events that take place at 
particular spacetime points (the Wblobs in Fig. 17.2) and we imagine 
that the particles propagate freely between interactions with the free 
propagator. 

We’ll now derive an expression for the free Feynman propagator 
A (x,y). 


Example 17.3 

Let’s get an expression for the free propagator. If we use the general expression for 
the annihilation field acting on the free vacuum </d(p)|0), we have 

d 3 p 

(2tt) t (2 E p )^ 

Only the particle creation part contributes (since 6 P |0) = 0) and gives 

_ ( d3 P 


A 


Fig. 17.1 The free-particle propagator 
A (x,y) is a line without an interaction 
blob since a free particle doesn’t inter¬ 
act with any other particles on its way 
from y to x. 


p(y) |0>« J XL a + 


(17.7) 


V 


A 


f (i/)|o> = J ■ 


— |P> e ' ! 


(17.8) 


( 2 tt )2 (2E p )2 

How do we get the other half? Take a complex conjugate of eqn 17.8 (swapping 
y —> x and p —> q) to get 


v 


A 


(0|^(x)= [ -^- r(<l\e- iq ' X - 

(17.9) 

A) 

j (27r) 2 (2Eq)2 


V: 


Sandwiching together eqn 17.8 and eqn 17.9 we obtain 

d 3 pd 3 g - iQ . x+ip . v 


(O|0(ar)0 + (j/)|0) = J 

- / 


(2n)3(2E p 2E q )2 


<5 (3) (q - p ) 


d 3 p 


a -ip-(x-y) 


(17.10) 


(2tt)3(2 E p ) 

which corresponds to a particle being created at (y°, y) and propagating to ( x°, x) 
where it is annihilated at the later time. 

We also need to consider the reverse order for the advanced part of the propagator: 

^- Y (oplOje- 1 ^ + S+|0)e i 

(2-it) 2(2E p )2 V 

Here, only the antiparticle creation part contributes (since a p |0) = 0), to give 

d 3 p 


0(*)|o> = j 


■9): 


(17.11) 

(17.12) 

(17.13) 

(27r)3(2F p r ' (17 ' 14) 

This corresponds to an antiparticle being created at point (rr 0 ,*) and propagating 
to (p°, y), where it is destroyed at the later time. Putting the two halves (eqns 17.10 
and 17.14) together we have our final answer for the free propagator A (x,y). 


Fig. 17.2 A third-order term in 
the perturbation expansion of G in 
terms of A (x,y). The amplitude 
for the process is proportional to 
A(x - Z 3 )V(Z 3 )A(Z 3 - Z 2 )V (22) A(Z 2 - 
zi)V(zi)A{z 1 - y ). 


<MA|0> = J 

^lex conjugate 

(0|^(w) = J 


3 T \py p ' x - 

(2tx) 2{2E p )2 

Again we take the complex conjugate (and change x —> y and p - 

d 3 q 


yielding, finally 


<O|0 f (?/)0(a;)|O> = J 


(2n)2(2E q )2 
d 3 p 


-(<?l e 


„i p(x-y) 
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We have the final result that the Feynman propagator is given by 


A (x,y) 


(0\T4>(x)ft(y)\0) 

d3 P uu n 


(2tt) 3 (2 E p ) 


[0(x° - y°) e - i P< x -y'> + 9{y° 


x°)e 


(17.15) 

ip-(x-y)\ 



Feynman 

particle + antiparticle = propagator 

Fig. 17.3 The Feynman propagator is 
made up of a sum of particle and an¬ 
tiparticle parts. 


Notice the way that this came together. We added together a particle 
part from the retarded half of the propagator and an antiparticle from 
the advanced part. This is illustrated in Fig. 17.3. 

We’ve been calling these propagators Green’s functions, but are they 
really Green’s functions in the sense of Chapter 16? The answer is no 
for the interacting case. However, the free-propagator functions are the 
Green’s functions of the equations of motion. So for the scalar field we 
have 

{d 2 + m 2 )A(x — y) = — i<5^ (x — y). (17.16) 

This is examined in Exercise 17.2. In general, propagators are defined 
in such a way that they are true Green’s functions in the absence of in¬ 
teractions. That is, free propagators A(a :,y) are true Green’s functions. 


17.3 Finding the free propagator for scalar 
field theory 

Our expression for the free Feynman propagator for scalar field the¬ 
ory in eqn 17.15 is not very useful in its current form. We would like 
an expression which doesn’t include the rather non-covariant looking 
Heaviside step functions. A simplification is achieved by using complex 
analysis, as shown in the next example. 


'This is given by 
d{x° - y°) = ij 


00 dz e~ iz ^ x °~ y °^ 

-no 27 T Z + \€ 


Example 17.4 

How does the computation unfold? One of the tools we’ll need is the expansion 
of 0(x° — y Q ) in the complex plane. 4 Consider the first term in eqn 17.15, call it 
[A(re, 2 /)](i), and substitute this expression for the 6 function: 

[ A 0 r, 2 /)](i) = 0(x° - 2 /°)(O|<£(x) 0 t (jO|O) 

d 3 p 


= 0(x u - y 




■i: 


- i E p ( x u - y u ) +ip • (* - y ) 

y) 


(: 2tc) 3 (2E p ) 

dzd 3 p e -i(Bp + z)(a: 0 -j; 0 ) + ip(a!- 

l—oo (27r)^(2 Ep) z -\- ie 

Next we make the substitution z' = z + E p , leading to the integral 

dz'd 3 p e -iP(.x°-y°)+ip(a:-y) 


[ A A,J/)](l) = i J 


(17.17) 


(17.18) 


/_oo (27r) 4 (2Ep) z' - Ep + \e 

Then we change the definition of our four-momentum in the integral. We treat z' as 
the new p°, redefining p = ( z'.p ) = (p°,p) leading to 

d 4 p e -ip-(*-v) 


[ A (^-l/)](i) = i J 


-oc (27r) 4 (2Fp) p° - Ep + ie 


(17.19) 
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With the redefinition, p° = E doesn’t equal (p 2 + m 2 ) 2 any more. It’s now just an 
integration variable that we happen to have grouped with the three-momentum to 
clean up the integral. Note that we still have that the energy E p = (p 2 + m 2 ) 2 . 

We can carry the same process out on the second term: 

[ a (z, 2 /)]( 2 ) = 0(y° - x o )(0\4>l)(y)4>(x\0) 

d ' ip p -iBp(!/°-a°)+ip-(l/-») 


= 9(y° - x°) J 

- ■/. 


(2tt) 3 (2 E p ) 
dzd 3 p e - i ( E P+ z )(y°-x°)+ i p(y-&) 


1-00 ( 2 tt ) 4 ( 2 £ p ) 
The substitution is now z' = z + E p , giving 


z + ie 


[A(*, „)](,) = i f 
the substitution p 
[ A (®,2/)](2) = - 


f°° dz'd 3 p e~ iz ' ( y ° ~ x °')+ i P(y- :D ) 

-00 ( 


(17.20) 


(17.21) 


(2tt) 4 (2£'p) p° - E p + ie 
Lastly, make the substitution p —> —p, which is to say (p°, p ) —> (—p°, —p), we get 

dz'd 3 p e -ip°(x°-y°)+ip(a!-y) 


-00 (2tt) 4 (2 E p ) p° + E p - ie 
d 4 p e -ip-(a:-y) 


(17.22) 


l-oo (27r) 4 (2i?p) p° + E p - ie 

The full propagator A (x,y) = [A(x, J/)](i) + [A(x, V)]( 2 ) is, therefore, the sum of 
eqns 17.19 and 17.22: 


A(x,y) = i J 

- / 


d 4 p 


(27r) 4 (2£ p ) 


-ip-(x-y) 


p° - Ep + ie p° + Ep - ie 


d 4 p 

(27T) 4 


-ip(x-y) 


(17.23) 


(p0) 2 -£ 2 +ie' 

Note further that using E 2 = p 2 + m 2 the term in the denominator of the integrand 
is (p 0 ) 2 — Ep + ie = p 2 — m 2 + ie. 


The final result of this example yields the key result that the free Feyn¬ 
man propagator 5 A(x,y) is given by 


AO, y) = (0\T$(x)4>(y)\0) = j 

f d 4 P ip . (x - y) i 

(27t) 4 p 2 — to 2 + ie 

(17.24) 

We can immediately extract A (pi), the Fourier component 6 of the Feyn¬ 
man propagator corresponding to a particle with momentum p: 

A (p) = ~2 

pz 

V ■ - (17-25) 

— m z + ie 


This turns out to be a very useful equation. 


J Because A(.x. y ) is a function of x y, 
it is sometimes written A(x — y). 


5 The connection between the quan¬ 
tities A(x,y) and A(p) is a simple 
Fourier transform: 

H^v) = f ($j e ” iP ' ( *“ y)A(p) - 


17.4 Yukawa’s force-carrying particles 

One of the most interesting thing about particles is that they interact 

with each other. Hidcki Yukawa’s great insight was that this interac- Hideki Yukawa (1907-1981) 
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^In our units, the length a = 1/m. (In 
SI units, a = h/mc.) For a pion (7T°) 
with mass 135 MeV/c 2 , a « 1.5 fm, 
roughly the size of a nucleus. 


(a) 



Fig. 17.4 (a) A virtual particle prop¬ 
agates from y to x mediating a force, 
(b) The same process occurs with a vir¬ 
tual antiparticle propagating from x to 
y. (c) The processes are summed to 
make the Feynman propagator. 


tion process itself involved particles. These force-carrying particles are 
quite different to their more familiar cousins with whom we’re already 
acquainted. They are, however, still described by propagators and pro¬ 
vide an immediate application of our field theory propagators. Yukawa’s 
idea centres around one key notion: 

Particles interact by exchanging virtual, force-carrying particles. 

Virtual particles are defined as particles existing ‘off mass-shell’. The 
mass-shell is the four-dimensional surface obeying the equation p 2 = 
Ep — p 2 = to 2 [see Fig. 11.1]. This, of course, is the usual dispersion for 
a relativistic particle. For an off mass-shell particle we have p 2 ^ to 2 . 
How can this be possible? The argument is that quantum mechanics 
allows us to violate this classical dispersion, as long as we don’t do it for 
too long! By invoking energy-time uncertainty AEAt ~ H, we can say 
that particles of energy E are allowed to exist off the mass-shell as long 
as they last for a short time At < H/E. Virtual particles, therefore, must 
have a finite range since (i) they can’t live forever and (ii) they travel 
at finite velocity. Yukawa guessed that the potential U (r) mediated by 
the virtual particle would have the form 

e -M/<* 

Uir) oc - (17.26) 

where a has the dimensions of length. 


Example 17.5 

i|As| |-r*| 

Looking back, we see that eqn 17.26 bears a resemblance to GjJ”(r) = > which 

is the Green’s function for the operator L = (V 2 + k 2 ). From this, we see that if 
we take i\k\ = —m, then we have that the Yukawa potential U(r) oc — e 47r | r | is the 
Green’s function for the equation 

(V 2 -TO 2 ) U(r) = <5 (3) (r), (17.27) 

which is a time-independent version of the Klein -Gordon equation. What we’ve 
shown here is that Yukawa’s potential is actually the field propagator describing the 
propagation of virtual scalar field particles. The price that’s paid for being off the 
mass shell is that the virtual particles can travel a distance fixed 7 by their mass m. 


The Yukawa particle exchange process is illustrated in Fig. 17.4. We can 
imagine spacetime processes such as that shown in Fig. 17.4(a) where 
at time y° particle A emits a virtual particle Q, with mass toq from 
position y that at a later time x° collides with particle B at position x. 
The energetics of this process may be written 

E a = E' a + E q , (17.28) 

that is, particle A loses energy Eq = (pg + TOq) 1 / 2 . Of course in this 
process particle B gains energy Eq. 
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We may be in serious trouble if Yukawa exchange involves a net trans¬ 
fer of energy from particle A. It makes the interaction rather one-sided 
for one thing. To maintain the symmetry of the interaction, we must 
also imagine the additional process shown in Fig. 17.4(b) where particle 
B (at position x) emits an identical particle <5 at time x°, which at later 
time y° collides with particle B at y. Both processes are necessary in 
order to conserve energy so we must have an identical virtual particle 
heading back to A, which gives back the same amount of energy. 

What does the Feynman propagator have to say about Yukawa’s no¬ 
tion of force-carrying virtual particles? Actually, the exchange of vir¬ 
tual particles idea is exactly what the Feynman propagator is describing. 
Recall that the Feynman propagator, describing a particle propagating 
from spacetime point x to spacetime point y, looks like 

A (x,y) = 6(x°-y°)(0\ci)(x)ft (y)\0) +0(y°-x°)(0\4>(y)* 4>(x)\0). (17.29) 


The first term describes a particle travelling from y to x for x° > y °, 
while the second describes an antiparticle travelling from x to y for 
y° > x°. These situations are what we showed in Fig. 17.4. and we see 
that both processes are included in the Feynman propagator. If we now 
turn to the Feynman propagator in momentum space we have 


A ip) = 


p2 _ m 2 _|_ (p 0 ) 2 — p 2 — m? + : 


(17.30) 


Notice that the denominator is always off mass shell, that is to say that 
the expression is only well behaved if p° ^ \J P 2 + fn 2 . The message is 
that the Green’s function describes the propagation of virtual particles. 

When we come to calculating Feynman diagrams in a few chapters’ 
time, we’ll see that the amplitude A for two particles to scatter off each 
other as shown in Fig. 17.5 contains a term oc i/ (p 2 — rn 2 + ie). We can 
check this immediately. If we want to calculate the scattering amplitude 
for two particles in non-relativistic quantum mechanics we can use the 
Born approximation, which tells us that the amplitude is proportional 
to the Fourier transform of the scattering potential: 

A oc U(p) = J d 3 r U(r)e~ ip r , (17.31) 

where p is a scattering vector. If we use Yukawa’s suggestion for the 
interaction potential U{r) oc — e 47r | r | corresponding to an interaction 
with a range of a, then doing the integral 8 yields 



Fig. 17.5 A Feynman diagram for 
two particles scattering. The ampli¬ 
tude for the process includes a factor 
i /{jp 2 — ra 2 + ic), which describes the 
propagation of a virtual, force-carrying 
particle which can be thought of as me¬ 
diating a Yukawa force. 


8 The integral can be evaluated using 


A oc 


1 

— \p\ 2 — m 2 ’ 


(17.32) 


which looks rather like the propagator A (p) with p° = 0. Our result for 
the propagator A (p) is the four-dimensional, relativistic generalization 
of this result. Indeed, as we saw previously, the propagator _■ , 2_ rrt 2 is 
the Green’s function for the time-independent Klein-Gordon equation 
(—V 2 + m 2 )(j)(x) = 0, while A (p) is the Green’s function for the four¬ 
dimensional Klein-Gordon equation ( d 2 + m 2 )(j>(x) = 0. 
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^ Remember that for a real scalar field 
cp(x ) = <jy (a:) and so we may write the 
scalar field propagator without the dag¬ 
gers: A (x,y) = (Q\Tcf)(x)<})(y)\Q). For 
simplicity’s sake, we will do this later 
in the book. 


1 Another way of making sense of the 
ie is that it is meant to ensure that 
the integral never hits this pole. It is 
an instruction to avoid it by adding an 
infinitesimal imaginary part to the de¬ 
nominator. This is discussed in Ap¬ 
pendix B. 


(a) (b) (c) 



(i) (2) (i) 


Fig. 17.6 (a) The two-point propaga¬ 
tor G^ 2 \ The blob represents the in¬ 
teractions that the particle undergoes 
with other particles in the system, (b) 
A four-point propagator . (c) An 
n-point propagator G^ n \ 

See Peskin and Schroeder for a dis¬ 
cussion of the LSZ reduction formula. 


17.5 Anatomy of the propagator 

Let’s take a closer look at the anatomy of the free propagator 9 A(x, y) = 
(0\T(f>(x)ffi (y)|0) for scalar field theory. It arose from the amplitude that 
a single field excitation is created at spacetime point y and is found at 
spacetime point x. It contains the sum of amplitudes corresponding to 
the possibility that this excitation might be a particle or an antiparticle. 
In position space we have that 

A(x,v) = (0\T$(x)fi(y)\0) = [ -V—■ (17-33) 

J (27r) 4 p z — m z + le 

It is important to notice that we have a singularity (or pole 10 ) in the 

function when the particle is on mass shell, that is, whenp 2 = E 2 —p 2 = 

o ^ 

m . 

Now let’s return to the full propagator for a general interacting the¬ 
ory G(x,y) = {Cl\^>(x)^(y)\Cl). (Remember that we call the interacting 
ground state |fl).) The full name for G{x,y) is the ‘single-particle prop¬ 
agator’ or the ‘two-point Green’s function’. We can draw a diagram 
illustrating G(x,y) as shown in Fig. 17.6(a). The two stumps corre¬ 
spond to the incoming or outgoing particles. The hatched blob in the 
middle depicts interactions: the adventures (involving other particles) 
that our test particle might have between its creation and annihilation. 
The name ‘two-point’ comes from the fact that a single particle is cre¬ 
ated and a single particle is annihilated, that is, the sum of particles 
entering or leaving the system, i.e. (number in)+(number out) equals 2. 
We often emphasise this by writing the propagator as G^ 2 \x,y) with 
a superscript ‘(2)’. We can go further and define the n-point propaga¬ 
tor, where n equals the sum of incoming and outgoing lines. These will 
be denoted G^ n \ For example we might want to study the four-point 
propagator [Fig. 17.6(b)] 

G^\x 1 ,x 2 ,x 3 ,X4) = (O|T^(xi)^(x2)0 t (x 3 )^ t (x4)|n), (17.34) 

which is useful when we’re considering two particles coming in and two 
particles leaving as might occur in a two-particle scattering experiment. 

The aim of much quantum field theory is to find the full interacting 
Green’s functions G. One reason is that G can be used to calculate 
scattering amplitudes using an equation known as the LSZ reduction 
formula. 11 Another reason is that the form of the interacting Green’s 
function tells us a lot about the system. For example, the two-point 
propagator, G^ 2 \x,y) = (O|^(x) 0 'l'( 2 /)|n), that tells us about the fate of 
single particles, is especially useful. 

To see this we can consider what happens if we start from a free 
scalar theory and slowly turn on the interactions to form an interacting 
theory. Before we turn on the interactions we have the free propagator 
A (p) =$s p2 _,^ +ie - The mass of the particle is given by the position of 
the pole. The residue at the pole (which is i) tells us that the operator 
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ft (y) creates precisely one quantum from the vacuum |0) (and later we 

destroy it). When we turn on the interactions it turns out 12 that the full 12 The details of the full propagator are 
propagator resembles the free version. The general form of the two-point discussed m Chapter 33. 
propagator in momentum space is 


G (2) (p) 


_i ftp _ 

p 2 — m 2 — S(p) + iT p 


where 


Multiparticle 

parts 


(17.35) 


• Z p tells us how likely it is for a particle with momentum p to exist 
stably without being destroyed by the interactions, 

• £(p) tells us about how the particle of momentum p interacts with 
the vacuum, 

• (2r p ) _1 is the particle’s lifetime, 

• The full propagator also contains a contribution from short-lived 
multiparticle states which can be scattered out of the vacuum. 


Chapter summary 


• The Feynman propagator for single-particle excitations in the field 
is given by G(x,y) = (n\Tftx)ft{y)\n). 

• For the scalar field theory the momentum space propagator for 
single particles is A (p) = 


Exercises 


(17.1) Show that the retarded field propagator for a free 
particle in momentum space and the time domain: 
Go(P,tx,q,t v ) = 0(t x - t y )(0\ap(t x )al(ty)\0) is 
given by 8(t x — t y )e~ l( - Eptx ~ E ' lty ^(p — q). 

(17.2) Demonstrate that the free scalar propagator is the 
Green’s function of the Klein-Gordon equation. 
That is, in (1 + 1) dimensions, show 


d 2 


d(x 0 ) 2 
= — ft 2 \x — y). 


~ +™ 2 ) {0\Tftx°,x)ft(y°,y)\0) 


By considering the Fourier transform of the field, 
show that the action may be written 


S = 


\j (^4^ _ P)(P 2 -fn 2 )4>{p)- (17.38) 


(17.3) Consider a scalar field theory defined by the La- 
grangian 


This provides us with an alternative method for 
identifying the free propagator Go(p) as (i/2 times) 
the inverse of the quadratic term in the momentum- 
space action. 

(I? 36 ) (17.4) Show that the Feynman propagator for the quan¬ 
tum simple harmonic oscillator with spring constant 


mu >o is given by 


£= ftd^(x )} 2 -—Mx )] 2 


(17.37) 


GM = - 


1 

m ) u> 2 — Wq + ie ’ 


(17.39) 
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(17.5) (a) Consider a one-dimensional system with La- 
grangian 


1 / dfi(x) 

2 \ dx 


2 

+ 



(17.40) 


The choice of sign makes this a Euclidean the¬ 
ory (described in Chapter 25). Descretize this 
theory (that is, put it on a lattice) by defining 
4>j = j=E p ^p elB ”i where j labels the lattice 
site, a is the lattice spacing and N is the number of 
lattice points. Using the method in Exercise 17.3 
show that the action may be written 

_ 1 7 / 2 2 2 \ 7 
cospa + m ) 

(17.41) 


and read off the propagator for this theory. 

(b) The Lagrangian for a one-dimensional elas¬ 
tic string in (l-t-l)-dimensional Minkowski space is 
written 

C = \ [(dofif - {d^) 2 } . (17.42) 

Descretize the theory by defining = 

^ E p / t^p(w)e _iu,t e lwo . Show that the prop¬ 
agator for the (phonon) excitations that this theory 
describes is given by 

G(w,p) = -3-^-T, (17.43) 

cj 2 — Uq(1 — cos pa) 

where u>q = 2/a 2 . 



The S'-matrix 



The Matrix is a system, Neo. That system is our enemy. 
But when you’re inside, you look around, what do you see? 
Businessmen, teachers, lawyers, carpenters. The very minds 
of the people we are trying to save. But until we do, these 
people are still a part of that system, and that makes them 
our enemy. 

Morpheus (Laurence Fishburne) in The Matrix 


Canonical quantization works well for a small number of theories, where 
it results in a diagonalized Hamiltonian describing the excitations of the 
quantum held in terms of non-interacting, or ‘free’ particles. In fact, the 
theories that can be solved exactly using the methods of quantum held 
theory generally describe free particles with no prospect for interactions. 
A world of free particles would be a very boring place since particle 
interactions are at the heart of the workings of Nature. Most interesting 
theories involve particle interactions and cannot be solved exactly. We 
therefore need to resort to some sort of approximate method, like the 
perturbation theory of quantum particle mechanics. 

We will explore a perturbation theory for quantum helds in the follow¬ 
ing chapters. The Lagrangian describing the held is often the addition 
of two parts: a solvable part (frequently describing non-interacting par¬ 
ticles only and therefore called the free part) and a part that makes 
the problem unsolvable (frequently describing interactions and called 
the interacting part). As before, the free part is able to be solved via 
canonical quantization and results in non-interacting particles. The in¬ 
teraction part is more interesting because of one key fact: 

Interactions involve the creation or destruction of particles. 

One example of an interaction process is particle scattering. Particles are 
hred at each other. At the start of the experiment they are far from each 
other so don’t interact: they are free. When they’re smashed together 
they interact but only for a short time. In this chapter we’ll describe 
a method of dealing with the interactions in a scattering process. This 
will be time well spent. It turns out that the machinery developed here 
will be central to creating a more general perturbation theory that can 
deal with other interactions. 


18.1 The S'-matrix: a hero for our 
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18.1 The S'-matrix: a hero for our times 


John Wheeler (1911-2008) made nu¬ 
merous contributions to theoretical 
physics and is also famous for coining 
the terms ‘black hole’ and ‘wormhole’ 
and ‘it from bit’. 





(b) 


(c) 


'4 s‘~ 

M 




Fig. 18.1 A prototypical scattering ex¬ 
periment. (a) The particles begin well 
separated, (b) They interact in a com¬ 
plicated way. (c) They recede from each 
other and end up well separated. 


One of the basic building blocks of quantum field theory is the scattering 
or S-matrix. The idea was one of John Wheeler’s and it’s a great 
one. The S'-matrix is rather like the time-evolution operator, but for an 
important special case: it describes a Scattering experiment. 

In a prototypical scattering experiment we start off with well separated 
particles [Fig. 18.1(a)]. We fire the particles at each other and they 
interact in some complicated way governed by the Hamiltonian of the 
real world H [Fig. 18.1(b)]. The particles then recede from each other 
and end up well separated [Fig. 18.1(c)]. The first thing to say about this 
is that the complicated interactions of the real world make this process 
impossible to analyse. Working in the Heisenberg picture, we can’t even 
write down convincing expressions for the time-dependent operators. All 
appears to be lost! What do we do? 

Wheeler says that we start by imagining a world without compli¬ 
cated interactions. That is, we split H into two parts H = H 0 + H ', 
where Hq describes a simple world of non-interacting particles described 
by some set of state vectors. Remember that we’re working in the 
Heisenberg picture here: these state vectors don’t change at all. For 
two particles one simple-world state might be a momentum state like 
IV’} = \P 2 P\ )simpieworid• Now we look at the real world and ask ‘Is there 
a real-world state that looks like the simple-world state \p 2 P 1 ) simpieworid?’ 
There is such a state at the start of the scattering experiment (at a time 
t —> — 00 ) when the particles are very far apart. We’ll call this real-world 
state |p 2 Pi)]e a i wor i ( j or an ‘in’ state, for short. Similarly, we pick out an¬ 
other simple-world state, e.g. \<j>) = | <7291 Simpieworid and ask if there’s a 
real-world state that looks like this one. There is, but this time at the 
end of the experiment (t —» 00 ) when the particles are well separated 
after their interaction. We’ll call this one | q 2 <Zi)°eaiworld- Notice that the 
simple-world states only describe the real world in the limits t —> ± 00 . 

For the real-world scattering process in which we’re interested the 
amplitude A for starting with |p2Pi)re a iworid anc ^ ending up with 

I ^2 ^1) realworld is S iven b y 


realworld (^1^2 |P2Pl) real world • ( 18 . 1 ) 

We must recreate this amplitude using the simple-world states which, 
after all, are the only ones we can work with. How can this be achieved? 
Wheeler’s answer is the S'-matrix. We define 

■A = realworld (9192 \P2Pl) realworld = simpleworld (9l92 | S\p2Pl) simpieworid ■ 

( 18 ; 2 ) 

So the S-matrix (that is, the matrix elements of the S-operator) contains 
the amplitudes for starting with a particular ‘in’ state and ending up 
with a particular ‘out’ state. 

Let’s take this further and calculate a real amplitude. We now need 
two things: 
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• A way of getting a suitable Hq to describe some useful simple-world 
states which resemble the ‘in’ and ‘out’ states. 

• A way of calculating an expression for the ^-operator. We can 
then use the eigenstates of the simple Hamiltonian to work out an 
amplitude. 

In the next section we’ll examine the machinery needed to get Hq, which 
involves yet another new formulation of quantum mechanics. This one 
is called the interaction representation. 


18.2 Some new machinery: the interaction 
representation 

So far in our quantum mechanics we’ve either had a formulation with 
time-dependent states and time-independent operators (the Schrodinger 
picture) or one with time-independent states and time-dependent oper¬ 
ators (the Heisenberg picture). A third way is available, called the in¬ 
teraction representation, where both the states and the operators have 
some time dependence. It turns out that this combination is the one we 
need to obtain Hq, the simple-world Hamiltonian. 

As stated above, to get Hq we split up the Hamiltonian into two parts 
H = Hq + H'. We call these a free part Hq and an interaction part H'. 
The free part will generally be one that is time-independent and can be 
easily solved. We then say that operators in the interaction picture 0\ 
evolve in time via the free part Hq of the Hamiltonian 

Oi{t) = e lkot de- iflot . (18.3) 

This is just like the Heisenberg version of quantum mechanics, except 
that we’re only using the free part of the Hamiltonian. We therefore 
have a Heisenberg-like equation of motion for the operators 

i^ = [d 1 (t),Ho}, (18.4) 

but again we emphasize that it just involves Hq. 

So far we’ve not included H' . Its inclusion results in the wave function 
having some time dependence, which it wouldn’t in the non-interacting 
case. To see what happens, we compare a matrix element from the 
Schrodinger picture to one in the interaction picture 

{<!>{t)\d\ip(t)) = (<f>i (t) | e lHot Oe~ lHot | V’i (t)}, (18.5) 

where the interaction picture states are labelled with a subscript I. We 
can see that, for the matrix elements to be the same as in the Schrodinger 
picture, we’ll need to define 


| Mt))=e iHot \m)- 


(18.6) 
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A 



Fig. 18.2 Turning on an interaction 
Hamiltonian H '. We multiply it by A (t) 
which has this profile. 


We can now get to the equation of motion for the interaction picture 
wave function by differentiating eqn (18.6) with respect to time. We find 

i^l Mt)) = j Aot (-H 0 + i^\m) 

= (-Ho + H) | m), (18-7) 

and recalling that H = Ho + H' and eqn 18.6 we have 

i^l Mt)) = (18-8) 

which we can rewrite in a recognizable form as 

i^lVhW) = ^iWIV’iW). (18.9) 

where H\{t) = e lHot H'e~ lHot . This then completely defines the interac¬ 
tion picture. 

To recap the description of the interaction picture: 

• Both the operators and states can evolve in time. 

• The operators evolve by the free part of the Hamiltonian: (j>\(t) = 
e 1 ^ 0 * <j) e~'^ ot . 

• The states evolve according to the interaction part of the Hamil¬ 
tonian: ijj|^>(t))i = where Hi(t) = e'^ ot H’ e _lJ ^ ot . 

Note that all of our different representations coincide at t = 0. 

18.3 The interaction picture applied to 
scattering 

Why is the interaction picture useful for the scattering problem? The 
point is that the interaction part of the Hamiltonian is zero at the start 
and end of the problem. In fact, we imagine that the interaction part of 
the Hamiltonian H 1 is turned on and off slowly and smoothly as shown 
in Fig. 18.2. When Hi = 0 we just have the Heisenberg picture for the 
free part of the Hamiltonian Hq. This is good because we choose Ho so 
that we can solve it using our canonical quantization machine and we 
certainly then know its eigenstates. 

What becomes of the states? We can identify the simple-world states 
as those of the interaction picture at the start and end: 

l^simpleworld ~ l < ( , i( = t° 0 )). (18.10) 

They are eigenstates of Ho, which is vital since it allows us to build 
them up from the state |0) which is the vacuum of Ho (called variously 
the non-interacting, free or bare vacuum) using our beloved creation 
and annihilation operators. During the interaction process we have that 
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Hi is nonzero so the states evolve in some mysterious and complicated 
way. However, at the end of the experiment Hi is zero again and the 
states freeze. It’s a bit like musical chairs in this respect. The states are 
initially sitting down, they rearrange themselves during the music and 
then freeze on the chairs when the music stops. 

We’ve thought about the states, but what about the operators? Sim¬ 
ple: the operators time-evolve according to Hq at all times. We can 
therefore use the freely evolving field operators like that we’ve used 
previously. These also enjoy an expansion in terms of creation and an¬ 
nihilation operators. 

Finally we need to figure out how to work out the S-operator. We are 
assisted by the useful fact that all of our quantum mechanical pictures 
are defined so that they coincide at t = 0, so that we’re free to write 

simpleworld I^11/’) simpleworld realworld ( < AIV ) ) realworld (<Mo)|V , I (o))- 

( 18 -n) 

Then, if we knew the time-evolution operator in the interaction picture 
Ui(t2,ti), we could say that 

simpleworld ( d I ^ \ d) simpleworld — (^i(oo)|t/i(oo,0)t/i(0, oo)|'i/ J i( oo )) 

= (0i(oo)|17i(oo,-oo)|i/’i(-oo)) (18.12) 

= simpleworld \0K "[ ( OC . 00 ) | V/i)simpleworld ■ 

We get the important result that the S'-operator is the time-evolution 
operator for the interaction-picture Ui(t, —t) as t —» oo. 


18.4 Perturbation expansion of the 
^-matrix 


As in the Schrodinger picture, we have an equation of motion for the 
interaction picture time-evolution operator 

ix— Ui(t2, t\) = Hi(t2)Ui(t2,t±), (18.13) 

dt2 


where U(t,t) = 1. We might be tempted to treat Hi(t) as a number, 
and if that were valid we could then integrate and write U(t 2 ,t\) = 
tldg is wrong. The interaction Hamiltonian Hi(t) 
doesn’t necessarily commute with itself when evaluated at different 
times. That is, in general, (H^b), Hi(ti)] ^ 0. To circumvent this 
problem, we’ll define a new version of the exponential which will make 
this equation right. The solution turns out to use the time-ordered 
product 

T[A 1 (t 1 )A 2 (t2)...A n (t n )l (18.14) 

defined as the string arranged so that the later operators are on the left. 1 
As discussed in the previous chapter, the Wick time ordering symbol 
symbol T[ ] is a little like the normal ordering procedure N[} in that it is 


bust remember ‘later on the left’. 
Note that the anticommuting nature of 
Fermi fields causes us to pick up an ad¬ 
ditional factor (—lA’ on time ordering, 
where P is the number of permutations 
required to rearrange the operators. 
From here until Chapter 35 we will fo¬ 
cus our discussion on (bosonic) scalar 
fields. Fermi fields, whose behaviour is 
based on the same general principles we 
will examine, are discussed specifically 
in Part IX. 
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Freeman Dyson (1923- ) 


not an operator, it’s an instruction on what to do to a string of operators. 
The reason that using T[ ] helps us here is that everything within a time- 
ordered product commutes, so we may now take a derivative. We can 
therefore write an expression for U\{t 2 ,t\) that incorporates correctly 
time-ordered operators as follows: 


Ui{t2,h)=T 


e * St i dtHi(t) 


(18.15) 


This expression is known as Dyson’s expansion. 


Example 18.1 

We’ll pause to justify eqn 18.15. Using the fact that everything within a time-ordered 
product commutes we can treat Ui(t 2 ,ti) as a function of t 2 and take a derivative 
with respect to this time to obtain 


dt2 




= T 


i7i(t2)e- i/ *i 2dt " i ( ‘)' 


(18.16) 


Next we notice that t% is the latest time in the problem (since U evolves the system 
from ti to t' 2 ) and the time ordering therefore puts the operator //[ ( 1 - 2 ) left-most. 
We can then pull H\(t 2 ) out of the time-ordered product to obtain 


d*2 




= Mt2)T 


5 -i/ tl 2 dt Hi(t) 


(18.17) 


Comparing this with eqn 18.13 we see that the time-evolution operator in the inter¬ 
action picture is given by 


U 1 (t 2 ,t 1 ) = T 


-i/t? dtH r (t) 


(18.18) 


We also know that the S-operator is the limit S = Uifa —> oo,ti —> 
—oo). Using this, we are left with Dyson’s expansion of the S'-operator 


S = T 


d 4 x TCi(x) 


(18.19) 


where we’ve replaced / dtHi(t) with / d 4 xHi(x), an integral of the 
Hamiltonian density. 

The exponential form of Dyson’s expansion in eqn 18.19 is beautifully 
memorable, but not very useful. Usually the integral in the exponent 
cannot be done exactly so we have to expand out the exponential in 
Dyson’s expansion thus: 


S = T 



d 4 zHi(z) + 4^ j d 4 y d A wTLi{y)'H\{w) + ... 


(18.20) 

Provided that the interaction Hamiltonian Hi{x) is small compared to 
the full Hamiltonian density, then this provides the basis for a perturba¬ 
tion expansion of the S-matrix. We’ll deal with this expansion in detail 
in the next chapter. Notice that in eqn 18.20 we integrate over different 
spacetime coordinates each time the interaction Hamiltonian appears. 
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18.5 Wick’s theorem 


To make progress in digesting long strings of operators that appear in 
Dyson expansions, we are faced with solving the following conundrum: 

• We will frequently have to evaluate a term like 

(0\T[ABC ...Z}\0), (18.21) 

a vacuum expectation value (or VEV for short) of a time- 
ordered string of operators. This is hard. 

• On the other hand, what is very easy to evaluate is 

(0\N[AbC ... Z]|0), (18.22) 

a VEV of a normal-ordered string of operators. This is trivial 
because normal ordering places annihilation operators on the right 
and creation operators on the left, and so this VEV is identically 
zero. So if we could find a way of relating iV [stuff] to T[stuff] then 
the problem would be solved. 

The next thing to notice is that the mode expansion of a field opera¬ 
tor <f> contains two parts, an annihilation part and a creation part (see 
eqn 11.8) and so can be written (/> = (/>“ +(f> + . Thus we have 2 </>”|0) = 0 
and (0|(/> + = 0. Let’s now take the simplest non-trivial case of a string 
of operators and consider two field operators A and B. Their product 
is given by 

AB = (i+ + A~)(B+ + B~) = A+B+ + A~ B~ + A+B~ + A~ B+. 

(18.23) 

If we normal-order this product, we get 

N[Ab} = A+B+ + A~B~ + A+B~ + B+A~, (18.24) 


where we have only needed to swap operators in the final term. Thus 
IV [AB] only differs from Ab by 

AB - N[Ab} = A~B+ - B+A~ = [A~,B +], (18.25) 


i.e. by a simple commutator of operators (which we know from previous 
experience 3 will just be a complex number, or c-number for short). Now 
the fields A and B are functions of spacetime coordinates, so we could 
also write down the time-ordered product 


T[A(x)B(y)} 


A(x)B(y) x° > y° 
B(y)A(x) x° < y°, 


(18.26) 


and therefore 

T[A(x)B(y)\ - N[A(x)B(y)} 


[A ( x),B+(y )] x° > y° 
[B-(y),A + (x)\ x° < y°. 


(18.27) 


2 

Our definition assigns the superscript 
+ to the creation part and — to the 
annihilation part. Some books, e.g. Pe- 
skin and Schroeder, choose an opposite 
convention. 


^ Commutators turn out to be quanti¬ 
ties such as i h. 
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4 The proof is by induction, and can 
be found on page 90 of Peskin and 
Schroeder. 


It’s worth noting that Wick’s theorem 
applies for free fields only, such as those 
in the interaction picture. This will be 
important in Chapter 22. 


Since the VEV of a normal ordered product is zero, we can immediately 
write down 


<0|T[i(*)B(y)]|0) 


(0|[i-(*),B+(i/)]|0) x° > y° 
<0|[a-(y),i+(*)]|0> x°<y°. 


(18.28) 


If we choose A = B = </>, then the quantity on the left is simply a 
Feynman propagator. 

Subtracting out a normal-ordered string of operators from the time- 
ordered version is clearly a useful thing to do and so we formalize this 
and give it a special name: a contraction, defined as 


i—i 

Ab = T[AB] - N[AB]. (18.29) 


i—i 

The contraction AB is simply a commutator (see eqn 18.27) because 
the only difference between normal ordering and time ordering is that 
we have shuffled various creation and annihilation operators, thereby 
accumulating various c-numbers. Thus the contraction is a c-number, 
and it takes the value 

i— ii— i i—i 

AB = AB(0|0) = (0|AB|0> = (0|T[AB]|0). (18.30) 

Moreover, since it is only a c-number, normal ordering has no effect on 
it and we can write 

i—i i—i 

t[Ab] = n[Ab] + Ab = n[Ab + Ab]. (I 8 . 31 ) 

This result can be generalized 4 to longer strings of operators to yield 
Wick’s theorem, which can be stated as follows: 

all possible contractions of 
ABC ...Z 

(18.32) 


T[ABC ... Z] = N 


ABC ...Z + 


Example 18.2 

Wick’s theorem can be illustrated for the case of four operators 


T[ABCD] = N [AbCd] + N 

i i 

ABCD 

+ N 

ABCD 

+ N 

ABCD 

f . H .1 

r. r- 1 

ll 

r. . r 

ll 



+N 


+N 


ABCD 


ABCD 


■ N 


■ N 


ABCD 

'H=TT 

ABCD 


■N 


-N 


ABCD 


ABCD 


(18.33) 


This is a sum of normal ordered terms including zero, one or two contractions. 
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Let us look at a particular term involving a single contraction in a Wick 

expansion of T[ABC ... Z\, say for example N[AbCDEFG ...]. Since 
the contracted part is a c-number, we can factor it out of the normal 
ordered product 5 and write 

i-1 i—i 

N[ABCDEFG ...] = CF x N[ABDEG...}. (18.34) 

However, when we evaluate the VEV of this term it will vanish because 
(01 N [anything] 10) = 0. Therefore, the only terms which survive when we 
work out the VEV of T[ABC... Z] are the ones involving contractions 
of all operators. 6 Thus 

i— ii—ii— i i—i 

(0\T[ABC... YZ]\Q) = (0\T[ABCDEF ...YZ}\0) (18.35) 

I r~]—i i—i i—I 

+(0| t[Abcdef ...YZ] |0) 

( all other combinations involving \ 
contractions of every pair of operators J ’ 


c 

There could be a sign change in 
this expression if the operators are 
fermionic, so in that case you have to 
pay attention to any sign changes ac¬ 
cumulated with swapping the order of 
operators. 


^By extension, the VEV of an odd- 
numbered string of operators will be 
zero, since you can only contract pairs 
of operators and so there will always be 
one operator left over. 


and using eqn 18.30 this reduces to 
(0\T[ABC...YZ]\0) 

= <0|T[iH]|0)<0|T[(7n]|0)<0|T[ J EF]|0)...(0|T[FZ]|0) 

+ {0\T[Ac]\0}(0\T[I3D}\0}(0\T[EF}\0) ...(0\T[YZ}\0) 
+ ... 


that is, the VEV of a time-ordered string of operators is given by the 
sum of products of all possible combinations of VEVs of time ordered 
pairs. 


Example 18.3 

For the case of four bosonic operators. 


(o|t UbodI | 0 > 



r m m ■ 


1 —Fi 1" 


1 1-1 1 

<0|T 

ABCD 

|0> + <0|T 

ABCD 

|0) + (0|T 

ABCD 


(0\T[Ab]\0)(0\T[CD]\0) + {0\T[Ad]\0)(0\T[BD]\0) 
+(0|T[AD] |0) (0| T[BC] |0>. (18.37) 


In particular 

(0\^(x\)<j>{x2)rj>(x3)4>{x4)\t)) = A(xi - X2)A(X3 - X4) + A(xi - X3)A.(X2 - X4) 

+A(iri — X4)A(x2 — X3), (18.38) 

where we have used the Feynman propagator A(:ri — X 2 ) = (OfFb(.ri ) 0 (x 2 )|O). 


Wick’s theorem is an essential tool for expanding S in perturbation 
theory which we discuss in the next chapter. 
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Chapter summary 

• Real-world scattering processes can be described by an S'-matrix 

element via realworld^lV’K-ealworld = simpleworld ( < / > |'S|' ! / , )simpleworld- 

• The S'-matrix works in the interaction picture where operators 
time-evolve according to the free part of the Hamiltonian Hq and 
states time-evolve according to the interaction part Hj(t). 

• The /S'-operator is given by the Dyson equation S = 

T[e~ i/d 4 ^ 1 ^)]. 

• Wick’s theorem allows us to grind down long time-ordered strings 
of operators. A vacuum expectation value (VEV) of a time-ordered 
string is a sum of products of VEVs of time-ordered pairs. 


Exercises 

(18.1) Although the interaction picture is most useful to 
us as a step towards a powerful version of pertur¬ 
bation theory, it is also useful in itself for some 
problems, such as this famous example from mag¬ 
netic resonance. Consider a Hamiltonian describing 
a spin- 1/2 particle in a large, static magnetic field 
Bo which is subject to a perturbing, perpendicular, 
oscillating held B\: 

H = jBoSz + yBi(S x cosySof + S y sinyBot). 

(18.39) 

Here 7 is the particle’s gyromagnetic ratio. Notice 
that the frequency 7 Bo of the oscillating held with 


amplitude B\ is chosen 

so that it matches the en- 

ergy level separation caused by the static field Bo. 
(a) Write the problem in the interaction represen¬ 
tation by splitting the Hamiltonian into free and 

interacting parts. 

(b) Use the identities 


S± = 

S x ± i S y , 

S+e iult = 

it oS z t A —it oS z t 
e o+e , 

S-e~ iult = 

e iuS z tg e ~iu>S z t^ (18.40) 


to simplify the interaction Hamiltonian H\(t). You 
should find that you’re able to remove the time de¬ 


pendence completely. 

(c) Find the interaction picture evolution operator 

Ulfa, ti). 

(d) What is the probability that a particle initially 
prepared in a state | |) at t = 0 will still be in that 
state at time t. What is the probability that it will 
be found in the | |) state? 

(e) Find the expectation value of the S z operator. 

(18.2) Show that \ip(t = oo))i = X)^(/ ) |>5|'!/)|/>), where the 
states on the right-hand side are the simple-world 
states defined in the chapter. 

(18.3) Use Wick’s theorem to express the string of Bose 
operators a p a\ak in terms of normal ordered fields 
and contractions. 

(18.4) (a) Normal order the string of Bose operators 
bgbtftf using the usual Bose commutation relations, 
(b) Show that Wick’s theorem gives the same an¬ 
swer. 

(18.5) Use Wick’s theorem to simplify 

(0|Cpi_<jCp2+gCp 2 Cpi 1 0), 

where the operators c p and c p create and annihilate 
fermions respectively. 
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They were funny-looking pictures. And I did think con¬ 
sciously: Wouldn’t it be funny if this turns out to be useful 
and the Physical Review would be all full of these funny 
looking pictures. It would be very amusing. 

Richard Feynman (1919-1988) 


Like the silicon chips of more recent years, the Feynman 
diagram was bringing computation to the masses. 

Julian Schwinger (1918-1994) 
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One of Richard Feynman’s greatest achievements was his invention of 
(what are now called) Feynman diagrams. These are cartoons that 
represent terms in the perturbation expansion of the S'-matrix. We will 
see that Feynman diagrams exist in several related forms, but the most 
straightforward of these are simply spacetime diagrams describing the 
trajectory of particles. We imagine time running up the page and rep¬ 
resent particles by lines with an arrow going in the direction of time 
[Fig. 19.1(a)]; antiparticles are represented by lines with arrows going in 
the opposite direction. (This is consistent with Feynman’s interpreta¬ 
tion of negative energy states as positive energy antiparticles travelling 
backwards in time.) We can draw particle and antiparticle pairs being 
created at some time [Fig. 19.1(b)] or destroyed [Fig. 19.1(c)] at some 
other time. We can also tell stories: the line in Fig. 19.1(d) may look 
like it describes a single particle looping backward in time, but Feynman 
has another interpretation. Just as it’s unclear to a bomber pilot flying 
over a curving road whether (s)he is above a single road or a number 
of them, we’re asked to fly over the diagram experiencing everything in 
time order. From the point of view of the bombardier, we start with 
a particle for t < £ 2 , then at t = ^ a new particle—antiparticle pair is 
created. The new particle and antiparticle coexist with the original par¬ 
ticle in the interval t 2 < t < < 3 . At t = £3 the newly created antiparticle 
collides with our original particle and annihilates. The newly created 
particle continues out of the diagram for t > <3 [see Fig. 19.1(e)]. Since 
the particles are identical, no-one can tell the difference. 

In this chapter we introduce the perturbation expansion of the S'- 
matrix and its representation in terms of Feynman diagrams. Recall 
from the previous chapter that an expression for the S-operator is given 



Fig. 19.1 The Feynman diagram, (a) 
A particle is represented by a line with 
an arrow going in the direction of time 
(conventionally up the page). An an¬ 
tiparticle is represented by a line with 
an arrow going the opposite way. (b) 
A particle and antiparticle created at 
time t = to . (c) A particle and antipar¬ 
ticle are annihilated at time t = t\. (d) 
Trajectory of a single particle can be 
interpreted as follows: (e) A particle- 
antiparticle pair are created at £ 2 , the 
antiparticle of which annihilates with a 
third particle at t = £ 3 . 
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1 Notice also that we integrate over all 
z in eqn 19.1 thus taking into account 
the possibility of the interaction taking 
place at every point in spacetime. 


o 

This has a very similar structure to 
the interaction used in QED. There the 
electron corresponds to the excitations 
in the psi-field and the photon to exci¬ 
tations in the phi-field. 


® ^ 5 ) 




Fig. 19.2 Spacetime diagrams rep¬ 
resenting some commonly encountered 
interactions. 


by Dyson’s expansion 


S = T 


e -i'f d*zHi(z) 


(19.1) 


To use this we must expand it as a power series in the operator 'Hj(z). 
We’ll now consider some of the forms of 'Hi(z) that are encountered in 
quantum field theory and see how the expansion of Dyson’s equation 
proceeds. The interaction Hamiltonians will generally involve the prod¬ 
ucts of free-held operators localized at some point 2 . We’ll see later 
that if we interpret the held operators as describing particles then the 
interaction 1 can be thought of as a collision of the particles at spacetime 
point z. 


19.1 Meet some interactions 

There are a number of models of interactions to try. For each inter¬ 
action we will draw a diagram in spacetime illustrating the interaction 
processes. This will eventually lead to the famous diagrams invented by 
Richard Feynman. At present we won’t make any interpretations of the 
diagrams in terms of particles: this will only be possible a little later 
when the held operators are made to act on states. 


Example 19.1 

(i) The simplest interaction Hamiltonian 'Hi(z) involves a scalar field 4>(z) interacting 
with an external source field J(z) at a spacetime point z: 

Hi(z) = J{z)<t>{z). (19.2) 

We draw the source J(z) as a blob at a point 2 . The field is drawn as a stumpy line 
attached to the source blob [see Fig. 19.2(a)]. 

(ii) The simplest self-interaction of fields is </> 4 (say ‘phi-fourth’ or ‘phi-four’) theory 
described by a Hamiltonian 

ni(z) = (19.3) 

This interaction causes four scalar cf >-fields to meet at a spacetime point z [see 
Fig. 19.2(b)]. The diagram shows four stumpy lines meeting at 2 . 

(iii) Another simple interaction is 

Wi(z) = flY’ t (z)«/’( 2 )</>( 2 ), (19.4) 

where <p(z) is, again, a scalar field and ip(z) is a complex scalar field. This interaction 2 
causes an adjoint psi-field [represented by ip^(z)], a psi-field [ip(z)] and a scalar phi- 
field [<£( 2 )] to meet at z. To keep track of fields and their adjoints we use the 
convention that a field 1/1 is shown by a stumpy line with an arrow pointing towards 
the interaction vertex [see Fig. 19.2(c)]. The adjoint field ip' is shown with an arrow 
pointing away from the interaction. 

(iv) Another very useful interaction is non-relativistic and describes the Coulomb 
interaction 

H\(x -y)= (x)^ (y)V (x - y)S(x° - y°)ip(y)ip(x). (19.5) 

This one is delocalized. It says that a psi-field and an adjoint psi-field meet at x. 
These interact, via a potential V (x — y), with a psi-field and adjoint psi-field at y. 
The 5-function ensures that the interaction is instantaneous (that is, occurs at the 
same time). [This is shown in Fig. 19.2(d).] 
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19.2 The example of 0 4 theory 

We’ll now carry out the full procedure of calculating an S-matrix element 
for the simple case of </> 4 theory. The Lagrangian density describing the 
interacting theory is given by 

£ = \ \d^{x)} 2 - ^y#c) 2 ^ (19.6) 

The free part is given by the first two terms Cq = — which 

give rise to a free Hamiltonian operator upon canonical quantization 

Wo = ^ (l^J + ( V ^) +™ 2 4’ 2 ■ (19.7) 

When we use the S-operator we work in the interaction picture, so the 
free Hamiltonian time evolves the field operators <t>(x). The states evolve 
via the interaction Hamiltonian which we read off from the interacting 
part of the Lagrangian C\ = — gy^C^) 4 as Wi = fr^*) 4 - Next we need a 
programme to expand the S-matrix using this interaction Hamiltonian. 

It’s a glorious five-step plan. 

Step I 

Decide what S-matrix element to calculate and write it as a vacuum 
expectation value (VEV) of the free vacuum |0). We do this so we can 
use the simple form of Wick’s theorem later. 

As an example we’ll take our ‘in’ state to be a single particle in a 
momentum state p and the ‘out’ state will be a single particle in a 
momentum state q. We’ll be calculating the amplitude 

A = ont (q\p) in = <g|%> 

= (2n) 3 {2E q )i(2E p )i(0\a q Sal,\0), (19.8) 

where we recall that the relativistic normalization of our states means 
that |p) = (27r)i (2£lp)5d^|0). 

Step II 

We expand the S'-operator using Dyson’s expansion 

S = T exp i J d^zTl^z)^ 

= T 1 — ij d 4 z7fi(;g) + ^ ^ J d 4 pd 4 w Th{y)Tli(w) + ■.■ (19.9) 

= T 1 - ^ J d 4 z<i>(z) 4 + J d 4 yd 4 w${y) 4 $(w) 4 +... . 
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Step III 

Plug the resulting expression for the S'-operator into the expression for 
the S'-matrix element that we’re trying to calculate 

-4 = (g|S|p) 

= (2n) 3 (2E q )H2E p )^f <0|a,at|0> + (^) J d 4 z (0\aj(z) 4 al,\0) 
+ Q) J d 4 yd 4 w (0\d q ^(y) 4 ${w) 4 dl,\0) + ... . 

(19.10) 

Since the amplitude is a sum of terms ordered by powers of A, it makes 
sense to label the terms A = At°) + + A ^ where A t ' n ' > is the 

term proportional to X n . 


Step IV 


3 

Note that contractions involving a- 
operators don’t need T symbols since 
here the d p -operators are understood 
to create particles at t = — oo only, 
while the dp-operators destroy particles 
at t = oo only, which fixes the time or¬ 
dering unambiguously. 


Use Wick’s theorem to grind down the terms. That is, for each term we 
write the string of operators bookended by the vacuum states as a sum 
of all of the contractions of pairs. 

Let’s first consider A^\ the first-order part of the expansion 
(the part proportional to A). Using Wick’s theorem on the string 
(0|dq<^(a;) 4 d^|0) = (0\a q $(z)4>(z)(j)(z)(j)(z)d^ p \0), will yield up two sorts 
of term. The first contains contractions of the a-operators and the </>- 
operators separately, like 3 


(0\a q j>(z)j>{z)4>(z)4)(z)dl\0) 


= (Q\a q al\d}(Q\T(t>(z)4){z)\0}(0\T4)(z)j){z)\0}. 


(19.11) 


There are two other combinations of contractions of the ^-fields that 
give this term, making three in total. 

The second type of term contracts a-operators with cj> operators. 
There are twelve ways of doing this. One example is 


\a q (t>(z)(t>{z)(/)(z)(j)(z)al\0} 

= (O|fiq^(2:)|O)(O|T<^(2)<^(2:)|O)(O|0(2)dp|O). 


(19.12) 


The result of this set of manipulations is that the first-order term in the 
expansion of the S'-matrix element for (q\S\p) is given by 


^Remember that for a scalar field 
</>( x) = (jy ( x ) and 


A (y 



d 4 q i e-^h/-*) 
(27 t) 4 q 2 — m 2 + ie ’ 


where q is a dummy momentum, over 
which we integrate. 


Al (1) =-^ J d 4 z 3(0|aqaJ,|0)(0|^(2;)^)|0)(0|^(2;)^(2:)|0) (19.13) 

+ 12(O|dq^(i:)|O)(O|T^(2:)^(2)|O)(O|0(x)aJ > |O) . 

Here are some rules for making sense of the contractions: 

• Contractions between two fields make a free propagator 4 for that 
field, that is, (j>(y)(j>(z) = (0\T(j>(y)(j>(z)\0) = A (y - z). 
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• The contraction between a field and the creation operator 
from the initial particle state corresponds to a factor 

i _ l g—i p-z 

(2tt)2 (2E p )i 


Example 19.2 

To see this, we use the expansion on the field <f>(z): 


(0 o(z)(T p 0 ) = 


I 

I 

! 


d 3 q 


1 


(27t)2 (2 E q ) 2 
d 3 g 1 


i q e~ iqz + a\e lqz 


(a q e~ iqz + a\e lqz ) 


(2?r)2 (2Eq)2 

-^r^T-e-^W^-p) 
(2tt)2 (2 E q ) 2 

! 1 e~ ipz . 


(2?r) 2 (2E P ) 2 


4i°> 

|p> 


(19.14) 


Notice that, following our conventions, the factor e _lp ' z cor¬ 
responds to an incoming particle. Notice also that the fac¬ 
tors —4 1 | from the contraction exactly cancel against the 

( 2 tt )§ ( 2 E p )h 

factors from the relativistic normalization of the states | p) = 
(27r) 2 (2 E p ) 2 , with the result that the contraction $(x) \p) = e~' p ' x . 


• Similarly, the con trac tion between a final state annihilation oper¬ 
ator and a field a a d)(z) corresponds to a factor ——— r e lq ' z . 

. (2vt)2 (2E q )z 

This is the outgoing particle. 

i—i 

• Contractions between initial and final particles a q a^ p = (0|aqa^|0) 
yield a delta function S^ 3 \q — p). 


Example 19.3 

As an example of the use of the rules, consider the term: 

I2{0\a q ct>(z)cf>(z)j>(z)4>(z)dl\0). 
Using the rules, this is represented by 

12 


1 1 

’ r d 4 fc e -ik (z-z) 

1 ^ c~ ipz 

( 2 tt )2 (2 E q ) 2 

J (2 rr) 4 k 2 — m 2 + ie 

_( 27 r )2 (2E p ) 2 


(19.15) 


Remember that to compute the contribution of this term to the S'-matrix element 
we will need to integrate the variable z over all space. That is, on including the 
normalization factors, we have that A , which is what we’ll call the contribution to 
the amplitude \p) from the above contraction, is given by 


A = 12 x (2n) 6 (2E q )2 (2E P )2 


1 ( —iA) r 
4! J 


d*Z 


1 


1 


( 2 tt )2 (2E q ) 2 


d 4 fc 


-ik- (z — z) 


(27r) 4 k 2 - m 2 + ie ( 27r ) I ( 2 E p )2 


(-iA) 


I 


d 4 2 - 


d 4 fc 


k 2 - , 


(19.16) 
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Notice that the normalization factors of (2ir) and (2 E p ) have cancelled. This was one 
intention of the conventions for decorating the earlier equation with these factors. 


We could, at this point, collect the terms together and prepare to in¬ 
tegrate the position of the interactions over all spacetime. However, 
there’s still one more part of the plan to carry out. 


Step V 

Make sense of a term by drawing a Feynman diagram. A Feynman di¬ 
agram represents an amplitude in the expansion of the S'-matrix. A 
particular term in the expansion will comprise a certain number of in¬ 
teractions. This number is the same as the order of expansion of the 
S'-operator. A second-order term has two interactions, a third-order 
three, etc. To represent the interaction in a diagram we draw the inter¬ 
action vertices exactly as shown in Fig. 19.2. Each line segment (we will 
call these legs) emerging from an interaction vertex represents an uncon¬ 
tracted field operator ej)(z). The Wick contractions (that we carried out 
in step IV) join the legs of the interaction to each other [for contractions 

like (f>(x)(j)(y)\ or to external particles (for contractions like cj)(x)a ) p ). 
Here is a list of rules for drawing a diagram. 



Fig. 19.3 Steps in drawing a diagram. 


• Draw the interaction vertices and label them with their spacetime 
coordinates [Fig. 19.3(a)]. 

• Contractions between the initial state and a field, i.e. (j)(x)a^ pl are 
drawn as incoming lines connecting to one of the legs of the vertex 
[Fig. 19.3(b)]. This corresponds to a real, on-mass-shell particle 
coming into the story. 

i-1 

• Propagators resulting from the field-field contractions (f>(x)(j)(y) 
are drawn as lines linking the points [Fig. 19.3(c)]. We can think 
of these as virtual particles which are internal to the story the 
diagram is telling. 

• Contractions between the final state and a field a q 4>(x) are drawn 
as an outgoing lines [Fig. 19.3(d)]. These correspond to on-mass- 
shell outgoing particles. 


Example 19.4 

Let’s again consider the term 

J d 4 z(0\d p j>(z)4>(z)4>(z)j)(z)al\0). (19.17) 

The process of drawing a diagram is shown in Fig. 19.3. 

• It’s a first-order diagram (it contains one copy of 7ii) so we draw one interac¬ 
tion vertex at position 2 : [Fig. 19.3(a)]. 
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• The contraction (j>(z)dp is represented by an incoming line that grabs one of 
the vertex legs [Fig. 19.3(b)]. 

• The contraction <b(z)<p(z) ties together two vertex legs [Fig. 19.3(c)]. 

• The contraction a q (f>(z) grabs the remaining vertex stub and forms a line 
leaving the diagram [Fig. 19.3(d)], 


The diagrams that result from our Wick expansion of the S'-matrix to 
first order are shown in Fig. 19.4. The expression for eqn 19.11 is shown 
in Fig. 19.4(a), that for eqn 19.12 is shown in Fig. 19.4(b). 


19.3 Anatomy of a diagram 

The diagrams we’ve drawn so far could correspond to stories describing 
the interaction of particles if you think of time running upwards. A 
contraction gives you a full diagram. This may be made of many pieces. 
Figure 19.4(a) is a diagram made of two pieces, while Fig. 19.4(b) is a 
diagram made of a single piece. We call a piece of a diagram a con¬ 
nected diagram. Figure 19.4(a) is a disconnected diagram made up of 
two connected diagrams. Disconnected processes cannot influence each 
other (because they are not connected to each other!) and so physical 
intuition tells us that we will probably only have to consider connected 
diagrams (that expectation will turn out to be right on the money). 5 
There’s more vocabulary to be learnt in order to talk diagrams with the 
professionals: 

• External lines have one end which appears not to be connected 
to anything. (Actually, they indicate a connection with the world 
exterior to the physical process.) Figure 19.4(b) or the left-hand 
piece of Fig. 19.4(a) are connected diagrams with external lines. 

• A vacuum diagram has no external lines. The right-hand piece of 
Fig. 19.4(a) is a (connected) vacuum diagram. Vacuum diagrams 
don’t affect transition probabilities because they are not connected 
to the incoming or outgoing particles; they only affect (0|5|0) (and 
therefore only contribute a phase e 1 ^ to transition amplitudes). 

A particular connected diagram might contain: 

• Vertices where lines join together. These represent interactions. 

• External incoming lines (we draw these below the vertices). They 
represent on-mass-shell particles entering the process. 

• External outgoing lines (we draw these above the vertices). They 
represent on-mass-slrell particles leaving the process. 

• Internal lines (joining two vertices). These represent virtual par¬ 
ticles which are off-mass-shell and therefore exist internally within 
the diagram. 



Fig. 19.4 First-order contributions in 
</> 4 theory, (a) A disconnected diagram 
with two connected parts, (b) Con¬ 
nected diagram with one loop. 

5 

The cluster decomposition prin¬ 
ciple guarantees that distant experi¬ 
ments yield uncorrelated results. An 
5-matrix constructed out of creation 
and annihilation operators turns out 
automatically to satisfy this principle, 
which therefore provides the deep rea¬ 
son why these operators are required in 
quantum field theory, rather than just 
being a convenience. The cluster de¬ 
composition principle may be justified 
for connected and disconnected Feyn¬ 
man diagrams using the linked cluster 
theorem, discussed in Chapter 22. See 
Weinberg, Chapter 4 for a detailed dis¬ 
cussion. 


Outgoing 

external 



Incoming 

external 

line 


Fig. 19.5 The anatomy of a discon¬ 
nected third-order diagram. 
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Remember that these are the rules for 
<p 4 theory. Other theories will have 
slightly different rules. 


These various properties are illustrated in Fig. 19.5. 

Why do we draw Feynman diagrams? Simple: knowing how the di¬ 
agrams relate to the contractions means that we can simply draw dia¬ 
grams for a particular interaction and, instead of going through all of the 
leg work of doing an expansion, we just write down the equation to which 
a diagram corresponds. If someone were now just to give us a Feynman 
diagram, how could we translate it into a term in the expansion? Here 
are the rules: 

Feynman rules for <j > 4 theory in position space 

To calculate an amplitude in the S-matrix expansion, translate a 
Feynman diagram into equations as follows: 

• Each vertex contributes a factor — iA. 

• Each line gives a propagation factor A (x — y), where x and y are 
the start and end points of the line. 

• External lines contribute incoming (—ip • x) or outgoing (+i p ■ x) 
waves e ±ip x . 

• Integrate the positions of the vertices over all spacetime. 

• In order to get the right coefficient in front of the term divide by 
the symmetry factor D. 

This last point is treated in the following section. 


19.4 Symmetry factors 


6 Often the symmetry factors aren’t re¬ 
quired to understand the physics of 
what a diagram is telling you, but when 
combining several diagrams it’s useful 
to know them. 


A potentially tricky thing is how to work out what number D we divide 
by in working out our Feynman diagram amplitude. The number arises 
from the number of ways there are to produce a certain diagram through 
contractions, divided by 4! (In fact the reason for the 4! is to simplify 
this number as much as possible.) Rather than deriving them we’ll 
quote the result which the interested (and/or masochistic) reader can 
easily justify by working through the combinatorics. 6 The general rule 
is as follows: if there are m ways of arranging vertices and propagators 
to give identical parts of a diagram (keeping the outer ends of external 
lines fixed and without cutting propagator lines) we get a factor = m. 
The symmetry factor is given by the product of all symmetry factors 

D = Yl t D i- 

Two very useful special cases, which aren’t immediately apparent from 
the general rule, are as follows: 


• Every propagator with two ends joined to one vertex (a loop) gives 
a factor Di = 2. 

• Every pair of vertices directly joined by n propagators gives a 
factor Di = n\ 




19.5 


Calculations in p-space 183 


Example 19.5 

Figure 19.6 shows several diagrams to which we can apply the rules. 

• (a) The bubble propagator has D = 2 since it contains a propagator with two 
ends joined to a vertex. Remember you are not allowed to move the ends of 
external lines, so there are no more factors to worry about. 

• (b) The two-bubble propagator has D = 4. Each bubble gives a factor of 
2, so D = 2 x 2. The bubble propagators in (a) and (b) are examples of 
self-energy diagrams. They are basically free propagators with extra loops 
in, but don’t interact with anything else. We will see later that these extra 
loops just change the constants in the free propagator, or to use the lingo of 
quantum field theory, they ‘renormalize the free propagator’. 

• (c) The double-bubble vacuum graph has two bubbles (or loops), each con¬ 
tributing a factor 2, but the bubbles can be swapped over (rotating the diagram 
180° about a vertical axis passing through the middle of it), and we get the 
same diagram, giving us an extra factor m = 2. Therefore, we have D = 8. 

• The Saturn diagram (d) has a pair of vertices joined by three lines, so D = 
3! = 6. 

• Diagram (e) has a factor of 2 from the bubble. It also has two vertices joined 
by two lines, contributing a factor 2!, so D = 2 x 2! = 4. 

• Diagram (f) has three vertices. We think of these as being grouped in two 
pairs (with the middle vertex counted twice). Each pair is joined together 
by two lines (and so each contributes a factor 2!). In total we therefore have 
D = 2 x 2! = 4. 



• Diagram (g) has a bubble, so D = 2. 

• Diagram (h) has two bubbles but the top and bottom parts between the ver¬ 
tices can be swapped giving an extra factor of 2, sofi = 2x2x2 = 8. 

• Diagram (i) has a pair of vertices joined by two lines so D = 2. 

Other theories will have slightly different rules for their symmetry factors. 


The derivation of the Feynman rules represents the point where we can 
start calculating scattering amplitudes for physical processes. A large 
part of formulating a useful quantum field theory may be reduced to 
the process of deriving the Feynman rules following the path shown in 
Fig. 19.7. 


19.5 Calculations in jd-space 

Calculations turn out to be far easier in momentum space, as illustrated 
by the following example: the second-order ‘Saturn diagram’. 


Example 19.6 

Let’s see how the Saturn diagram arises. Consider the second-order Of A 2 ) term in 
the expansion of (gISjp) (i.e. two copies of Tii^x) in the expansion), 

(-i) 2 


s (2) = 


ff 


2 ! 

(~iA) 2 f 
2!(4!) 2 J 


d 4 yd 4 ui r Hi(y)'Hi(z) 

d 4 yd 4 w 4>(y)4>(y)(j>(y)<j>{y)4>{w)4>{w)4>{w)4>{w). (19.18) 


Fig. 19.6 Examples of diagrams in </> 4 
theory. 



Fig. 19.7 The process of deriving the 
Feynman rules. 
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Fig. 19.8 

diagrams. 



|*2 if *3 


Fig. 19.9 

turn space 


(a) Position and (b) momen- 
Feynman diagrams. 


We now apply Wick’s theorem and obtain the diagrams shown in Fig. 19.8. The one 
in which we’re interested is Fig. 19.8(f), which comes from the term 


(q\S^\p) = (2n) 3 (2E q )l(2E p )i 


/ iA ')2 |—rn . 1 1 , 1—1 „ 1 „ 1 1 - 1 

x —=p- {0\a q 4>(y)4>(y)4>(y)4>(y)4>(w)4>(w)4>(w)4>(w)al |0>, 


(19.19) 


where D = 6 is the symmetry factor here. The amplitude for the Saturn Feynman 
diagram in Fig. 19.9(a) is given by 

A 2 r 

d 4 yd 4 we lq y A{y - w)A(y - w)A(y - w)e~ ip w . (19.20) 


Each spacetime propagator A gives us a factor of 


All of the second-order 0 4 


A (y -w) = J 


Q K c -i k (y-w) 1 

(27r) 4 k 2 — m 2 + ie 


(19.21) 


So, upon collecting all of the exponentials together, we have 


Saturn = — 


A 2 f d 4 /ci d 4 k 2 d 4 k 3 


6 J (27t) 4 (27t) 4 (27r) 4 (k 2 — m 2 + ie) ( k 2 — m 2 + ie) (fc§ — m 2 + ie) 
x/ dV.(^»..»...)(.<«-«.^.). 

Next we notice that some of the integrals will give us delta functions, which simplify 
things considerably. Integrating y over all space gives us an additional factor of 
/d 4 ye- i ( fc i+ fc 2+ fe 3-9)-a = (2 tt) 4 <5( 4 )(fc, + fc 2 + k 3 - q) and we have 

A 2 [ d 4 fci d 4 fc 2 d 4 fc 3 i i i 

6 J (27r) 4 (27 t) 4 (27r) 4 (fc 2 — m 2 + ie) (k 2 — m 2 + ie) ( k 2 — m 2 + ie) 

X J d 4 «> ( e i ( k i+ k 2+k3)-w(2n) 4 S W (k 1 + k 2 + k 3 - q), 

which means that we may set k\ = q — k^ — k-$ upon doing the k\ momentum 
integral. We see that the integral has given us a constraint on the allowed values of 
the momenta. Translated into real life, this means that the sum of four-momenta 
at each vertex is zero. Energy-momentum is therefore conserved at each interaction 
vertex. So now we have 


Saturn = — 


A 2 f d 4 k 2 d 4 fc 3 


6 J (27r) 4 (27r) 4 [(q — k 2 — k 3 ) 2 — m. 2 + ie] (k 2 — ni 2 + ie) (fc 2 — m 2 + ie) 

x J d 4 w (e i «" u > e - i P- w ') . 

The w integration gives f d 4 we 1 ( q ~ p )" w = (2n) 4 8^ 4 \q — p). The overall energy- 
momentum is conserved and the entire diagram carries an overall energy-momentum 
conserving delta function around with it. We’re left with two momentum integrals 
to do. Finally we arrive at the momentum space result: 


— A 2 r 

Saturn = ———(27r) 4 (^ 4 ) (q — p) / 


d 4 k 2 d 4 k 3 i 

(27r) 4 (27 t) 4 [(q — k 2 — k 3 ) 2 — m 2 + ie] 


(k 2 — m 2 + ie) (A: 2 — m 2 + ie) 


. (19.22) 


Although we’ve ended up with the same number of integrals as with the position 
space version, we’ve expunged all of the exponential factors. In addition, since the 
energy-momentum conserving delta function is common to all diagrams, we often 
choose not to carry it around, but to understand that energy-momentum is im¬ 
plicitly conserved (and reinstate the factor when it’s needed in the mathematics!). 
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We may now write down a set of Feynman rules for </> 4 theory in mo¬ 
mentum space: 

Feynman rules for (i > 4 theory in momentum space 

• Each vertex contributes a factor —iA [Fig. 19.10(a)]. 

• Label each internal line with a momentum q flowing along it and 

describe it by a propagator . 2 Fig- 19.10(b)]. 

• Force the sum of each momentum coming into a vertex to be 
equal to the momentum leaving it. 

• Integrate over unconstrained internal momenta with a measure 

d 4 g 
(2tt) 4 ’ 

• External lines contribute a factor 1 [Fig. 19.10(c)]. 

• Divide by the symmetry factor. 

• Include an overall energy-momentum conserving delta function 
for each diagram. 

Remember that the external lines don’t contribute a propagator. In 

momentum space the external line parts are just a p (f>(x) = 1. Remember 
also that you only need to integrate over unconstrained momenta. That 
is, only integrate over momenta which are not already determined by the 
conditions that momentum is conserved both at the vertices and overall 
for a diagram. (If in doubt you can include a delta function for each 
vertex and integrate over all momenta.) 



= —iA 



Fig. 19.10 Momentum space Feyn¬ 
man rules for </> 4 theory. 


Example 19.7 

Let’s now apply our rules to some example Feynman diagrams. Some terms in the 
expansion of (^|5|p) up to second order are shown in Fig. 19.11. Applying the rules 
to some of these yield the following for their contribution to (<?|,S'|p): 


(b) = 

(2tt) 4 <5 (4) (g-p) 
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(2tt) 4 5 (4) (q-p) 
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In summary: the amplitude A = out (q\p)' n = (q\S\p) may be written 
as a sum of diagrams. Each diagram stands for an integral. We call 


Fig. 19.11 Examples of Feynman di¬ 
agrams for (j > 4 theory, up to second or¬ 
der in the interaction strength, for the 
5-matrix element (< 3 , | 1 S'|p). 
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diagrams which describe how interactions affect the amplitudes of single 
particles self-energy diagrams. 

We haven’t yet discussed how to finally do the integral to finally get 
the numbers out. This is because often the integrals give us divergent 
(i.e. infinite) results! Taming these fearsome infinities reveals a huge 
amount about the physics lying behind quantum field theory and we 
discuss that in Chapter 32. Some diagrams don’t diverge, of course, and 
we examine some of these in Chapter 20. For now, we need some more 
experience getting used to how the expansions go. 

19.6 A first look at scattering 

We have looked at diagrams describing single particles. What about a 
process of two particles entering and then leaving? The amplitude we’re 
after is 

(qiQ 2 \S\p 2 Pi) = {2Tr) 6 (16E Pi E P2 E qi E q2 )^(0\a qi a q2 Sal 2 al i \0}, 

(19.23) 

describing particles entering in momentum states p\ and p 2 and leaving 
in states q\ and qi- We could of course now start the process again and 
work through all of the contractions. However, this is now unnecessary 
since we have Feynman diagrams for the theory. 


Fig. 19.12 Examples of Feynman 
diagrams for the 5-matrix element 
(QiQ 2 \S\p 2 Pi) up to second order in the 
interaction strength. 


Example 19.8 

Some diagrams contributing to the matrix element (^i< 3 , 2 |*S , |p 2 Pi) are shown in 
Fig. 19.12. The first two yield up 

Diagram (a) = (2 tt) 6 (16E Pl E P2 E qi E q2 )i <5 (3) ( qi - p,) <5 (3 ) (<j 2 - p 2 ), 
Diagram (b) = (2ir) 6 (lGE Pl E P2 E qi E q2 )i S ( - 3 '>(q 1 - p 2 ) <5 (3) (q 2 - p,). 
These clearly aren’t very interesting since no interactions occur. They just arise 
from the zeroth-order part of the 5-operator, = 1. They don’t contribute to 
scattering, and therefore aren’t measured in an experiment. We’ll ignore them. 
Some more interesting examples of amplitudes are the following: 

Diagram (c) = (2n) 4 S( 4 \qi + <?2 — pi — P 2 ){~ iA), 

/_ i^\2 

Diagram (f) = (27r) 4 <5 (4) (qi + 92 - Pi - P 2 )^-^- 

r d 4 fc i i 

J (27r) 4 [k 2 — m 2 + ie] [(pi + P 2 — k ) 2 — m 2 + ie] 


Chapter summary 

• The S'-matrix may be expanded in a five-step process. 

• The terms in the expansion can be encoded in Feynman diagrams. 

• It’s easiest to work in momentum space. 
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Exercises 


(19.1) Write down the momentum space amplitudes for 
the processes shown in the Feynman diagrams in 
Fig. 19.6. 

(19.2) (a) Draw the interaction vertex for 0 3 theory, 
which is a scalar field theory with Lagrangian C = 
±(<VA) 2 -^ 2 -^ 3 . 

(b) By expanding the S'-matrix find the contribu¬ 
tions to the amplitude (g|S'|p} up to second order 
in the interaction strength. Draw the correspond¬ 
ing Feynman diagrams. What are the symmetry 
factors? 

(c) Draw the new, connected diagrams that con¬ 
tribute to (g|S'|p) when you consider the fourth- 
order contribution. 

(d) Which connected diagrams contribute to the 
two-particle S'-matrix element (qiq 2 \S\p 2 Pi) at 
fourth order in the interaction. 

For simplicity, just draw topologically distinct dia¬ 
grams, ignoring new ones produced by permutations 
of external lines. 

(19.3) Consider the ABA theory, defined by the La¬ 
grangian 


£ = 


(a) Draw the interaction vertex using different 
styles of line for the different fields. 

(b) Draw the Feynman diagrams that contribute to 
the scattering amplitude (qa|<5|pa) up to fourth or¬ 
der (here pa refers to an A-particle in momentum 
state p). Write expressions for the amplitudes for 
these diagrams. You may ignore symmetry factors. 

(c) Draw the Feynman diagrams that contribute to 
the scattering (qb|S|pb) up to fourth order in the 
interaction. Write expressions for the amplitudes 
for these diagrams. Again, you may ignore symme¬ 
try factors. 

(d) Draw Feynman diagrams that contribute to 
the A-particle scattering (qAiqA 2 \S\pA 2 PAi) up to 
fourth order. 

(e) What are the rules for the symmetry factors of 
this theory? 

(19.4) (a) Show that the amplitude for the double bubble 
diagram [Fig. 19.6(c)] is given by 


A = 


—iA 

~8~ 


f d 4 p 

J (Onr\* 


(2-7t) 4 p 2 — m 2 + it 


( 5 (4 - ) (* = 0 ). 


(19.24) 

A^^^b) 2 -2^b — f ^ A< ^ B ^ A ' 

This theory describes the interaction of two scalar 
fields. 


(19.25) 

Here the <^ 4 ) = o) factor gives us a quantity pro¬ 

portional to the volume of the system V multiplied 
by the total time T. This factor is infinite, but we 
will show later that this will not worry us. 

(b) Argue that this factor arises for all vacuum di¬ 
agrams. 
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And the LORD shall cause his glorious voice to be heard, and 
shall shew the lighting down of his arm, ... with scattering, 
and tempest, and hailstones. 

Isaiah 30:30 

One important application of quantum field theory is the calculation of 
scattering cross-sections. It is important since scattering cross-sections 
are measured in many experiments. In this chapter we examine scat¬ 
tering using Hideki Yukawa’s i theory, which is an illuminating toy 
model describing the interactions of scalar fields. The joy of this model is 
that it bears a strong resemblance to quantum electrodynamics (QED), 
which describes electrodynamics to an astounding degree of accuracy. 
Specifically, in theory the phion excitations in the 0-field take the 

role of (massive, scalar) photons and the psion excitations in the 0-field 
describe (complex scalar, bosonic) electrons. 


20.1 Another theory: Yukawa’s 
interactions 



Fig. 20.1 The interaction ver¬ 

tex. 


The theory describes a complex scalar field and a real scalar field </> 
interacting. The full Lagrangian for this theory is 

C = - m 2 V’ t V’ + ^(A^) 2 - \h 2( t> 2 ~ ( 20 . 1 ) 


Here psions have mass m and phions have mass p. This Lagrangian is 
the sum of the free scalar field and free complex scalar fields Lagrangian 
with an interaction part, shown in Fig. 20.1, given by C\ = —g^'ip4>. 
We start by writing down the mode expansions of each of the free fields: 


ip{x) 
$ (x) 
<i> (x) 


d 3 p 

(20i 

d 3 p 


1 

(2 E p )\ 

1 


(2tt) 5 
d 3 q 


(2 E p )l 

1 


( 27 t )2 ( 2e q 


(a p e~ ip x + b ^ px ) , 
(a* e ip ' x + b p e~ ipx ^ , 
(c q e~^ x + c\e^ x ), 


( 20 . 2 ) 


where E p = ( p 2 + to 2 )! and e q = (q 2 + g 2 )^ ■ Here the a-operators 
describe the creation and annihilation of psions, the 6-operators de¬ 
scribe the creation and annihilation of antipsions and the c-operators 
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create and destroy scalar phions. The interaction Hamiltonian to use 1 Remember that Hi = -t\ so L\ 

in Dyson’s equation is 1 'Hi(z) = gift (z)il>(z)<j>(z), whose interaction dia- implies Hi = gfo■$(/>. 

gram is shown in Fig. 20.1. We decorate the interaction diagram with an 

arrow pointing toward the interaction blob for the t/^-field and an arrow 

pointing away for the -field. The meaning of these arrows will be to 

show particle number flow, as discussed a little later. 

To calculate S'-matrix elements we’ll work through our five-point plan. 

The most important new feature of this theory is that now we have the 
possibility of antiparticles in our interactions. 

Step I: We decide what to calculate. The one psion in, one psion out 
amplitude is 


A = (q\S\p) = (27r) 3 (2E q ) 1 H2E p ) 1 2(0\d q Sal,\0). (20.3) 


Step II: We expand the A-operator using Dyson’s expansion 

S = 1 + (—ip) f d 4 zft(z)$(z)4>(z) 

, (“iff) 2 


(20.4) 


d 4 yd 4 w 


2 ! 


W {y)^{y)<P{y)\ \i>'(w)i>(w)<j>(w) 


Step III: The next stage is to plug in to get the amplitudes for the 
various processes we’re going to calculate. All first-order terms will turn 
out to give zero (try it!) so we’ll consider the second-order term 

A (2) = ^ 2 -( 2 Tr) 3 ( 2 E p )h( 2 E q )h (Oldg^(y)ip(y)(j)(y)ip jl (w)ip{w)(i)(w)d p \0}. 

(20.5) 

Step IV: We use Wick to digest the matrix elements. Here are the rules 
for the nonzero contractions for this theory: 


%p(x)ft{y) = / (f/-- 


-ip(x-y) 


) 4 p 2 -m 2 + ie ’ 


I TH 

d p ^{x) = 


(2 tt )2 ( 2 E P ) 2 
1 


b p ip(x) = -J 

p (2 tt )2 ( 2 E p )2 

c q 4>(x) = 


1_ 


( 2 tt )2 ( 2 e ,)2 


4>(x}ky) = f (§r~- 

i>(x)a\, = 


-iq(x-y) 


I I 

$K X )% = 

s iqx , <j>{x)c\ = 


J (2 

n) 4 q 2 -g 2 +ie ’ 

1 

1 „-ip-x 

(2tt)§ 

1 1 
( 2£ P )5 

1 

1 „-i p-x 

( 2 tt )5 

(2 E p )2 

1 

3 

1 e - '"® 


(2tt) 2 (2 £q )2 


( 20 . 6 ) 

All other contractions give zero. Finally, we’ll agree to work in momen¬ 
tum space and note that the symmetry factor for all diagrams in r tp^'tfxj) 
theory is D = 1. 


Example 20.1 

Using these contractions let’s see what sort of diagrams are thrown up. Consider the 
second-order term with the following contraction 


(0\a q ^ly)ik{y)4>(y)^(w)4>(w)^w)al,\0). 


(20.7) 
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2 This name was coined by Sidney Cole¬ 
man. When the journal Physical Re¬ 
view objected to the name, Coleman 
suggested the alternative ‘Spermion’. 
Physical Review relented. 


This leads to the Feynman diagram shown in Fig. 20.2(a), which is known as a 
tadpole. 2 In a theory with no particles in the ground state this diagram gives zero. 
Consider another set of contractions: 


(0|a„ ft {y)i>{y)<i>{y)& ( w)i>(w) 4 >(w)al |0>. (20.8) 



Fig. 20.2 (a) A tadpole diagram, (b) 
An oyster diagram. 


H> 




Fig. 20.3 Disconnected diagrams 
making second-order contributions. 


This leads to the Feynman diagram shown in Fig. 20.2(b), known as an oyster, which 
when translated into equations gives an amplitude .Aoyster: 

( -i9 ) 2 f ATA 712- 1 2 1 • N 77- rp -r T (27r) 4 <5 (4) (g - p). (20.9) 

J (27r) 4 (k- 1 — y- 1 + le) [(p — k) z — m* + lej 

There are also two disconnected contributions involving vacuum diagrams, shown in 
Fig. 20.3. 


There are a few new things to note about the Feynman diagrams in this 
theory. The first is that we include arrows on the psion and antipsion 
lines of our Feynman diagrams, but not the phi lines. Psions have arrows 
going in the direction of ‘time’ (that is, up the page), antipsions have 
arrows going against the ‘time’ direction. These arrows don’t represent 
the direction of momentum. Actually arrows on the lines represent the 
(conventional) direction of Noether current Jn c (be. conserved particle 
number flow). Motivated by the expression Qnc = /d 3 p(hp^ — kip' 1 ) 
we say that incoming particles increase particle number and correspond 
to lines inwards on the diagram whilst incoming antiparticles reduce 
particle number resulting in outward going lines in the diagram. To 
avoid confusion, it sometimes helps to draw extra lines showing the 
directions of momenta (see, e.g. Fig. 20.6). Notice that this implies that 
an antiparticle has momentum in the opposite direction to its number 
flow. 


20.2 Scattering in the theory 

So far we’ve only considered one particle coming in and one particle leav¬ 
ing. Now let’s consider the process of two psions coming in, scattering, 
then two psions leaving. The S'-matrix element we’re after is 

(QlQ2\S\p2Pl) = (01Oq 1 Oq 2 S aj, 2 dj^ 10). (20.10) 

It’s fairly obvious that all first-order terms are zero (since it’s impos¬ 
sible to draw two-in two-out Feynman diagrams with only one of our 
interaction vertices). We’ll consider the second-order terms. 


Example 20.2 

From the contraction 


(0|a„ 


2 i’Hy)${y)<i>{y)'4’Hw)'i[>(w)4>(w)a t 


IpJO), 


( 20 . 11 ) 
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we have the diagram shown in Fig. 20.4(a). Two psions enter, one emits a force¬ 
carrying, virtual phion which collides with the second psion. The two psions then Stanley Mandelstam (1928- ) 
leave. This is known in particles physics as a ^-channel process (following notation 
introduced by Stanley Mandelstam in 1958 in which the three possible processes were 
assigned the letters ‘s’, ‘i’ and ‘ u ’). 

A closely related process arises from the following contraction 



( 0 | a qi a q2 $^y)$(y)4>(y)fl(w)'4>(w)j>{w)a'l 2 al >i | 0 ), ( 20 . 12 ) 

represented by the diagram in Fig. 20.4(b) and is known as a u-channel process. It’s 
the same as the ^-channel process, except that the initial and final particles have 
changed places after the interaction. To obtain the amplitude ( qiq 2 \S\p 2 Pi ) for the 
scattering of indistinguishable particles at second order add the amplitudes from the 
t- and u-channel processes. 

What about the following process describing psions interacting with antipsions 
(denoted |p))? The expression 

(5i92|S’|p2Pi> = (0|6q i a <Z2 S'a+, 2 6+, i |0> (20.13) 

describes a psion and antipsion entering, interacting, then leaving. We can still have 
the contraction which leads to a process analogous to Fig. 20.4(a), although this will 
involve antipsion lines with arrows pointing against the direction of time (i.e. down 
the page) attached to one vertex. In addition, another interesting new contraction 
possibility occurs here. Look at 



(0\bq 1 a q2 i>Hy)4>(y)<j>(y)i>Hw)’ip(w)<i>(w)al 2 bl i \0), (20.14) 

which corresponds to the diagram in Fig. 20.4(c). This depicts a psion and an 
antipsion coming in and annihilating. They become a virtual phion which decays 
into a psion and antipsion. This is known in particle physics as an s-channel process. 

Note that the //-channel process now involves the exchange of the particle and an¬ 
tiparticle, so becomes distinguishable from the t- and s-channel processes. It there¬ 
fore contributes to (qiq 2 \S\p 2 pi), rather than (qiq 2 \S\p 2 pi)■ To obtain the amplitude 
(?l'?2|S , |p2Pl) at ' second order we therefore need to add the amplitudes from the t- 
and s-channel processes only. 



The idea behind all of this exploration is to write down a set of Feynman 
rules for this theory, which we now do. 

Feynman rules for theory 

• Each vertex contributes a factor —i g [Fig. 20.5(a)]. 

• For each phion internal line carrying momentum q include a prop¬ 
agator g ^| 1 2 + g; [Fig. 20.5(b)]. For a psion internal line include 
a propagator g2 _^ 2+ie . 

• Integrate over all undetermined momenta. 

• Incoming and outgoing lines contribute a factor 1 [Fig. 20.5(c-d)]. 

• All symmetry factors are 1. 

• Include an overall energy-momentum conserving delta function 
for each diagram. 



Fig. 20.4 (a) A t-channel scattering 

process, (b) A u-channel process, (c) 
An s-channel process involving parti¬ 
cles scattering from antiparticles. 



Fig. 20.5 Feynman rules for the¬ 

ory. 
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(c) 

\q = p + p' 



Fig. 20.6 The t-, u- and s- 
channel scattering processes in momen¬ 
tum space. 


o 

The factor of i is included so that the 
amplitudes match up with the results 
from the conventions of non-relativistic 
scattering theory. 


^To remove them, we define the act of 
amputation, which involves chopping 
off all such carbuncles on the external 
legs. 


Example 20.3 

The amplitude for the second-order scattering diagrams (including the overall energy- 
momentum conserving delta function) is 


A^ = (-ig) 2 - 


; (2tt) 4 5 (4) (^2 Pi ~ Pi ) ' (20.15) 


q 2 — p 2 + ie ' 

Here we don’t integrate over q, as it is completely determined by momentum conser¬ 
vation for these diagrams. Referring to Fig. 20.6 we have the following amplitudes: 

• For the ^-channel process q 2 = (p r — p) 2 = t and we have 


At = (—i g) 2 


- (2tt) 4 (p' + k' — p — k ). 


t — p 2 + ie ' 

• For the u-channel process we have q 2 = (p' — k) 2 = u and 


A u = (-i gf 


u — p 2 + ie 


(27r) 4 5^ 4 ^ (p' + k' — p — k ). 


For the s-channel process we have q 2 = (p + p') 2 = s and 


As = (-ifl) 2 


S — ILL* 16 


— (27r) 4 <5^ 4 ) (k + k' — p — p 1 ). 


(20.16) 


(20.17) 


(20.18) 


20.3 The transition matrix and the 
invariant amplitude 

One thing that one immediately notices about the expansion of the S- 
operator is that the first term is unity. This means that the only nonzero 
amplitude from this term in the expansion has identical initial and final 
states. This is a rather dull result as far as scattering is involved (since 
nothing happens) so it’s commonly removed by writing S = 1+iT, where 
the matrix elements of T are often called the transition or T-matrix. 
Another irritation is that diagrams carry the overall energy-momentum 
conserving delta function. This is often factored out to make the so- 
called invariant amplitude Ai, defined for a two-in, two-out process as 

(pifP2f|iT>2iPii) = (2 tt) 4 <5 (4) (pu + p 2 { -p2i~Pu)iM, (20.19) 

which at least prevents us from having to continuously write down the 
delta function. 3 

We’re now ready to reveal a wonderful simplification. Remember that 
diagrams fall into two classes: connected diagrams and disconnected di¬ 
agrams. It turns out that only fully connected diagrams contribute to the 
T-matrix , a point which makes good physical sense since disconnected 
diagrams correspond to physical processes which do not influence each 
other. We also note that there are a class of diagrams which have loops 
attached to the external legs. These are bad news and often lead to 
unpleasant and non-trivial infinities. The good news is that they don’t 
contribute to the T-matrix either, so we can remove them. 4 We are left 
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with the appealing result that 


All connected, amputated Feynman 
LM(2 tt) 4 J (4) £ E diagrams with incoming momentum p\ 

and outgoing momentum pf 

( 20 . 20 ) 

We now need to design an experiment which measures M. and that 
turns out to be a scattering experiment. 


20.4 The scattering cross-section 


When firing one particle at another, we want to think about the ampli¬ 
tude of scattering of our incoming particle in terms of how big the other 
particle appears to be in cross-sectional area terms. Imagine driving at 
night, where your car headlights give you a beam of particles with which 
you probe the inky blackness. A sheep shows up more in your headlights 
than a field mouse, simply because it’s bigger and scatters more photons 
[Fig. 20.7(a)]. Similarly a cow will scatter more light if seen from the 
side than from the front, because it offers a larger cross-sectional area 
in the former case [Fig. 20.7(b)]. However it’s not just about area - 
white sheep are more visible than black sheep but we fold this all in 
to our definition of the scattering cross-section a in which we measure 
the rate R of a scattering process occurring as R = aL, where L is the 
luminosity 5 of our incoming beam of particles. 

Scattering can occur in all directions, and so in each bit of solid angle 
dfi there will be a bit of scattering dcr. A detector usually only covers a 
range of solid angle, so it is usual to think in terms of the differential 
cross-section 6 defined by dcr/dfl. Fermi’s golden rule allows us to 
relate the scattering cross-section to the modulus squared of a matrix 
element. After taking proper account of all the normalization factors 
one can show, 7 for example, that for a scattering process involving two 
equal-mass particles scattering off each other then 


clu _ | AH | 2 

dfi 647 t 2 Eq M ' 


( 20 . 21 ) 


where Eq m is the total energy in the centre of mass frame. 



Fig. 20.7 Scattering cross-sections, 
(a) cTgheep > Afield mouse 1 (k) 
°cow, side ' > °cow, front" 

5 

Luminosity is a quantity with dimen¬ 
sions Time -1 x Area -1 to be evaluated 
for the case under study. 

^ The relationship between a and 
dcr/dQ is given by 

r 27r r 1 da 

Jo d ^/_ 1 d(cOS,9) ^ =CT ' 

This result is discussed in Peskin and 
Schroeder, Chapter 4. 


Example 20.4 

We’re finally going to work out some scattering amplitudes and cross-sections for 
theory. We start with the scattering of two distinguishable psion particles. In 
this case only the ^-channel diagram [Fig. 20.6(a)] contributes. Let’s further simplify 
by considering only a non-relativistic case. This means that we can take the four- 
momentum p = ( E p ,p ) to be p « (m, p). We have 


Pli ss (m, p), p 2 i ~ (to, k), 
Plf ~ (m, pt), p 2 t ~ (to, fe')> 


( 20 . 22 ) 
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Max Born (1882-1970) 


^See Chapter 17. 


so t may be written t = q 2 = (pif — pn) 2 ~ — \p'~p\ 2 — — |<j| 2 , where q = p' — p is 
the three-momentum transfer. The scattering amplitude is given by evaluating the 
t-channel Feynman diagram yielding 

iM = |2 13 ' 2 , ( 20 . 23 ) 

\qr + p 2 

where we ignore the ie, since it’s unnecessary here. Plugging into our expression for 
the cross-section and noting that Eqm = 2m, we obtain 

dcr 1 / g 2 

dn ~ 256t r 2 m 2 \\q\ 2 + p 2 


(20.24) 


We have a measurable prediction from the theory. Interestingly this calculation 
can also be done in non-relativistic quantum mechanics using Born’s approximation, 
which says that the amplitude for scattering from a potential V(r^ is (p'liTlp) = 
—iV(q)(2n)5(E p i - E p ), where V(q) = (p'\V(r)\p) = f d 3 re i(p ~ p '> r V(r). Com- 
paring with our quantum field theory result, we see that Born says that our scattering 
potential is 


V(q) = 


- 9 " 


./2 ' 


\q\ 2 + ^ 

What function has a Fourier transform that looks like this? The answer is 

y « = -4$ r 


q -mM . 


(20.25) 


(20.26) 


the potential dies away over a distance l//r. But this is just Yukawa’s potential! 8 
This is the reason why the if>< ip(f> theory is often called Yukawa theory. 


0 Readers interested in fermions, pho¬ 
tons and quantum electrodynamics 
may now proceed to Chapter 36. How¬ 
ever, the main route of the book will be 
different. 


We now have enough formalism to start calculating amplitudes relevant 
in real systems. 9 However, in the next chapter we will take a break 
from quantum mechanics and discuss the seemingly unrelated subject 
of statistical physics. This subject has an unexpected connection to 
quantum field theory that will provide some powerful new insights. 


Chapter summary 

• Yukawa’s theory involves an interaction term Tii = ^(j). 

• We have introduced two new diagrams: tadpole and oyster. 

• The scattering matrix and cross-section can be related to the in¬ 
variant amplitude M. The only contribution to Ai comes from 
connected, amputated Feynman diagrams. 


Exercises 


( 20 . 1 ) 

( 20 . 2 ) 


Verify the rules for the contractions in eqn 20.6. 

(a) Verify that the Fourier transform of V ( r ) = 


47T |r | 




is V(q) = 


HF+m 5 


(b) By taking an appropriate limit, find the form of 
the Fourier transform of the Coulomb potential. 




Part V 


Interlude: wisdom from 
statistical physics 

Statistical physics is the study of large assemblies of atoms or particles 
and allows one to extract thermodynamic quantities through a process 
of averaging. Typically you write down the partition function Z for 
the system and manipulate it to generate the quantities you want. In 
quantum field theory there is an analogous process which involves a 
generating function Z[J\ which can be processed to bring forth Green’s 
functions. 

• A rapid crash course in basic statistical physics is given in Chap¬ 
ter 21, showing how the partition function can be used to manufac¬ 
ture any desired thermodynamic quantity or correlation function. 

• Chapter 22 develops these ideas for quantum field theory, intro¬ 
ducing the generating functional Z[J] and linking this with the 
S'-matrix via the Gell-Mann-Low theorem and showing how these 
concepts are connected with diagrams. 
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Statistical physics: a crash 
course 


21.1 Statistical mechanics 
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Relax, you know more than you think you do. 

Benjamin Spock (1903-1998), Baby and Child Care 

21.1 Statistical mechanics in a nutshell 

Consider a system with a Hamiltonian Hq that has energy eigenvalues 
E\. The probability of the system being in some particular state |A) 
with energy E\ at a temperature T is given by the Gibbs distribution 


Ex 

PA = ^ r , (21.1) 


where j3 = 1/k^T and Z is known as the partition function which is 
defined as a sum over states Z = e~ >3Ex , or equivalently 


Z = ^(A|e“^°|A) = Tr 

A 



( 21 . 2 ) 


t H H H H t - 

i=l 234567 89 10 

Fig. 21.1 A simple model of a one¬ 
dimensional magnet. 

lr This, of course, is shorthand. We 
mean that the spin states are eigen¬ 
states of the S z operator, which has 
eigenvalues S z = d=^. 


The term Z is needed to guarantee that XApa = 1, which is a require¬ 
ment of any probability distribution. However, the partition function 
Z is more important than a mere normalization constant. It turns out 
that all of the information about the system is contained in Z and for 
this reason Z is also known as a generating function. 

Consider a one-dimensional magnet which is made up of a lattice of 
N sites, each one labelled by an index i, decorated with spins (shown in 
Fig. 21.1). The spins have magnitude S = A and can point up (S z = +|) 
or down ( S z = ^l). 1 We define the operator-valued field fa. This is the 
same sort of field as we had before: we input a lattice point position i 
and get out an operator which we use to act on a state of the system. 
In this case the operator is S z i which we understand to act only on the 
spin in the ith position. 


Example 21.1 

A state-vector | A) lists the values of each of the spins in our one-dimensional magnet, 
e.g. |A) = | tti-i-li- •••)• We input a position (e.g. i — 4) and obtain an operator 
corresponding to that position <j> 4 . Acting on the state |A) with </>4 yields 

M mm •••> = mill---), (21.3) 




21.2 
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since the spin at position i = 4 has eigenvalue S z 4 = — |. 


Acting on |A) with <f>i yields S^|A), where S^ is the eigenvalue of 
the operator S z applied to the ith spin for the state |A). As usual, 
the expectation value of the fa operator for the state |A) is (A|</>j|A). 
At a temperature T we might want to know the thermal expectation 
value of the ith spin, denoted (fa)t which is found by summing over 
the expectation values weighted by the probabilities of finding a state of 
energy E\ thus: 


(<A)t = 

X 


^Y' S « e ~ flEx 

X 

X 

|5>|^e-^°|A). 

A 

Tr ^e"^ 0 
Z ' 


(21.4) 


The quantity p = e~^ H ° /Z is known as the probability density operator. 
Its matrix elements are known collectively as the density matrix. We 
conclude that to find the thermal expectation value of an operator in 
statistical physics we need to multiply it by the density matrix and then 
take a trace over all of the states. That’s pretty much all there is to 
statistical physics. 


21.2 Sources in statistical physics 

There’s a neater way of getting the thermal average of the field (fa ) t . 
It involves adding to the Hamiltonian Hq a fictional term H s = 
— (7 XA Jk4>k- Such a term, which involves coupling a field Jk to the 
spin at position k, is known as a source term for reasons which will 
come apparent later. 2 With the inclusion of the source term, the parti¬ 
tion function is now 


Z{J) = Tr 



= Tr 


e ~0Ho+J2k J k4>k 


(21.5) 


The point of this procedure is that we can now use the partition function 
Z(J) to work out the thermal average (4>i)t by differentiating Z(J) with 
respect to Jj, and evaluating this at Jj = 0 and dividing by Z(J = 0). 
That is to say 


(<A)t = 


1 


dZ(J) 


Z{J = 0) dJi 


Tr 


tAe 


-13 H 0 


Ji- 0 


Z(J = 0) 


( 21 . 6 ) 


as we had before. 


In the absence of a magnetic field, a 
system of non-interacting spins at some 
particular temperature is expected to 
have t = 0 for all i. This reflects 
the fact that spin i has equal proba¬ 
bility to be found with S z = 1/2 or 
— 1/2. If however a system shows spin 
order (more usually called magnetic or¬ 
der) then we would expect 7 ^ 0 for 
all i. 


2 

The source term for a magnet is actu¬ 
ally proportional to the magnetic field 
Bi , i.e. the value of the 5-field evalu¬ 
ated at position i. 
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But that’s not all. We can also use this philosophy to calculate cor¬ 
relation functions. A useful question in statistical physics is to ask 
about paired averages. For example, if we know the value of the spin at 
position i, we’d like to know the probability that the spin at position j is 
in the same state. The quantity we’re after is Gy = (fafa) t- This tells 
us about the degree of correlation between spins in different parts of the 
system. If the spins are completely random, then Gy = 0, whereas if 
they are completely aligned then Gy = | (because the spin eigenvalues 
are ±f). More interesting is when the correlation is partial and then 
we might expect to find Gy = \ when \i — j\ is small but Gy —> 0 as 
— j\ —> oo. Sometimes it is interesting to look for deviations from 
order, which is the situation when (fa) t ^ 0. In that case, one can 
examine the connected correlation function 

G C y = (fafa), - (fa)t(fa)t■ (21.7) 


Our notation makes these correlation functions look rather like cousins 
of the Green’s functions that we’ve been considering previously. Some 
example correlations we can find include 


Gy - (fafa)t - 


1 d 2 Z(J) 
Y( 0) dfadJj 


J—0 


Gijk — (fafa$k)t — 

and in general 

(fal ■■■fan) t = 


1 d 3 Z(J) 


Z( 0) dfadJjdJk 

1 d n Z(J) 


J—0 


Z( 0) dJ H ...dJ in 


( 21 . 8 ) 


(21.9) 


J—0 


The moral of this story is that by linearly coupling an operator-valued 
field fa to a source J, our partition function becomes a generating func¬ 
tion. Once we’ve found the generating function we gain access to the 
average value of fa and a vast set of related quantities. 


21.3 A look ahead 

There is a close analogy between statistical physics, as exemplified by 
the magnet, and quantum field theory. In fact everything we’re looked 
at in this chapter has an analogue in field theory. In going to field theory 
we’ll take a continuum limit, turning the lattice label i into the position 
x so that fa —> fax). The correlation functions of statistical physics 
become propagators in field theory and our goal is to find these. 

3 See Chapter 25. This is all very nice and certainly deserves further examination, 3 but 

the point of this chapter is to tell you that to reach the goal of finding 
propagators for a field theory we can use a generating functional Z[J] 
similar to that used in statistical physics. The generating functional for 
fields will allow us, through (functional) differentiation, to find all of the 
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Statistical physics Field theory 

(on a lattice) (continuum) 


Source 


Ji 


Generating function 
Green’s functions 
Differentiation recipe 
Order 


Z{J) 

= (4-4) t 

r (n) _ 1 d n Z(J) 

^*1 -in - Z(J= 0) dJ H ...dJ in 

{4>i)t 7^ 0 


J=0 


J(x) 

Z[J(x )] 

G (n) (2h, •••,£«) 

G (n) (a;i, ...,ar n ) 

(il|^(a:)|n) ^ 0 


(fi|T^(xi)...^(x n )|n) 

1 1 _ 6 n Z[J] 

i n Z[J=0] <5 J(xi)...SJ(x n ) 


J =0 


Table 21.1 Analogies between quantities in statistical physics on a lattice and those 
in quantum field theory. 


propagators. In equations, this can be written as 


G{x 1 , ...,£„) 


(n\T4>(x 1 )...(j)(x n )\n} 

1 1 5 n Z[J] 

i" Z[J = 0] 5J(x\)...5J{x n ) 


j -o 


( 21 . 10 ) 


One of the marvellous things about field theory is that even when 
we can’t solve a theory exactly, the generating functional Z[J\ and the 
Green’s functions G ^ (xi ,..., x n ) can be expressed in terms of Feynman 
diagrams. We will examine this in the next chapter. 


Chapter summary 

• Correlation functions in statistical mechanics can be extracted by 
differentiating the partition function Z with respect to the source 
term J. This approach carries over into quantum field theory be¬ 
cause Green’s functions can be extracted from a generating func¬ 
tional Z[J], Some analogies between statistical mechanics and 
quantum field theory are tabulated in Table 21.1. 


Exercises 

(21.1) Define the density operator as p = Us- of the number of excitations is given by 

ing the rule that thermal averages are given by , 

(A)t = Tr jyhij, show that for a quantum oscillator, (n )t = (a*a) = --. 

with Hamiltonian H = c oa'a, the thermal average 


( 21 . 11 ) 
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(21.2) Response functions or susceptibilities are ubiquitous 
in physics. Consider the forced quantum oscillator 

L = i mx(t ) 2 — ^muj 2 x(t) 2 + f(t)x(t), (21.12) 

and take the definition of the response function to 
be 

/ OO 

d t'x(t-t')f(t'). (21.13) 

-OO 


(e) Compare the behaviour of the response function 
with that of the correlation function S defined as 

S = \{{xi{t'),xi{t)}) = ^( xi{t')xi{t) + xi{t)xi{t')). 

(21.16) 

(21.3) Diffusion problems are common in thermodynam¬ 
ics. Here we meet the diffusion equation. The dif¬ 
fusion equation for the number density of particles 
n(x, t ) is 


(a) Using the interaction representation and treat¬ 
ing H' = —f(t)x(t) as the interaction part, show 
that, to first order in the force function fi(t), we 
have 

\ipi(t)) = \0) + i f dt'fi(t')xi(t')\0). (21.14) 

J — OO 

(b) Using the previous result, show that 

X(t - t') = i0(t - t')( 0| [xi(t),xi(t')] |0). (21.15) 

(c) Using the fact that xi(t) = 2 (ae -1 " 4 + 

a'e lult ), calculate the response function for the 
quantum oscillator at temperature T = 0. 

(d) Using the fact that at nonzero T we have 
(fi)t = (e^ 3 " — l) -1 , calculate the response func¬ 
tion for the quantum oscillator at nonzero T. 




The generating functional 
for fields 


22 


Since the finished product cannot be served to the table as it 
is, this isn’t so much a recipe, a method or a list of ingredi¬ 
ents: it is a way of life, for if there is fresh stock made every 
week, a reputation as a good cook will follow. 

Len Deighton (1929- ), Action Cook Book 

In this chapter we will present a general method of finding Green’s func¬ 
tions based on the philosophy of generating functionals. We will explain 
the meaning of the generating functional of quantum field theory, Z[J\, 
and show that all Green’s functions may be extracted from it though 
differentiation. This will allow us to relate together the S'-matrix and 
Green’s functions, linking two of the most important topics in quantum 
held theory. 

Although this chapter contains a lot of formal material the ideas are 
quite simple (and have much in common with the world of statistical 
physics). In fact the rules that emerge for perturbation theory allow us to 
write generating functionals and Green’s functions as sums of Feynman 
diagrams, relating formal objects such as G(x,y) and Z[J] to a series of 
cartoons. 


22.1 How to find Green’s func¬ 
tions 201 

22.2 Linking things up with the 
Gell-Mann—Low theorem 203 

22.3 How to calculate Green’s 
functions with diagrams 204 

22.4 More facts about diagrams 

206 

Chapter summary 207 
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22.1 How to find Green’s functions 

All of the information about a held theory is available in the form of 
Green’s functions. Happily there’s an easy way to work out Green’s 
functions which is based on a trick which enables us to lump all of the 
information about a quantum held theory in a single entity called the 
generating functional or Z[J], The generating functional is replete 
with all that juicy data in a zipped up and compressed form and there’s 
a simple procedure for extracting it. This is identical to the case of 
statistical physics, where all correlation functions were obtainable from 
the partition function Z(J). 

Just as in statistical physics we make use of the philosophy of sources. 

In order to make a generating functional we’ll add a source function J(a?) 

to our Lagrangian density, which couples linearly 1 with the held </>{x): 1 Note that by adding J<fr to £, we will 

end up subtractinq Jcb from 7 ~C. 

C[<j>(x)\ —> £[(j>(x)\ + J(x)<j>(x). (22.1) 

The reason why J(x) is referred to as a source is that in quantum held 
theory this term will generate particle excitations in the held <fi(x). By 
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turning on a source we are, essentially, grabbing the field and shaking 
excitations out. 

We now define the generating functional for quantum fields as 

Z[J] = (n\U(oo,-oo)\n)j, (22.2) 


where U is the time-evolution operator for the full Hamiltonian H of the 
theory and the J subscript on the right-hand side reads ‘in the presence 
of a source J’ when said out loud. We also have that H\fl) = 0. That 
is Q) is the physical ground state of the system and the ground state 
energy is defined to be zero. Turning eqn 22.2 into words we see that 
the generating functional Z[J] tells us the amplitude 


Z[J\ = 


no particles 
at x° = oo 


no particles 
at y° = —oo 


(22.3) 


that is, in the presence of a source, we start with no particles and end 
with no particles. We could call Z[J\ the ‘no particle propagator’. 

As we have defined it, Z[J\ allows two classes of process apart from the 
rather trivial option of nothing happening at all: (i) those where a par¬ 
ticle is created by the source J and later absorbed by J; (ii) those where 
a particle spontaneously appears and then disappears. The latter class, 
corresponding to Z[J = 0], is described by vacuum diagrams and needs 
to be removed by normalization in order for the generating functional 
to work. We therefore define a normalized generating functional: 


Z[J\ 


Z[J} 

Z[J = 0]' 


(22.4) 


This guarantees that Z[J = 0] = 1 or, in other words, that the amplitude 
to start and end with no particles in the absence of sources is unity. 

We now need to find a way of calculating Z\J\. Writing this in terms 
of the time-evolution operator as Z[J] = (fi|f 7 (oo, —oo)|fl)j turns this 
into a problem of finding U{ oo,—oo). A clue is provided by the way 
in which we worked out the ^-matrix in the last few chapters. In the 
interaction picture, the time-evolution operator U\{oo, — oo) is known as 
the ^-operator and the Dyson expansion gives us a way to calculate 
S via S = Te _1 -f d xHl . Things aren’t so simple here though, since 
U( oo,—oo) in eqn 22.2 is defined in the Heisenberg picture. The prob¬ 
lem is that we don’t have free fields in the Heisenberg picture so we 
can’t use Wick’s theorem to grind down long strings of operators (since 
Wick’s theorem only works on free fields). However we can still write 
a Dyson-like equation for the time evolution in terms of the Heisenberg 
fields 0 h- If we treat the source term like we would an interaction by 
saying Hi = —J(f> h, then the time-evolution operator is given by Dyson’s 
expression U( oo,—oo) = Te~'f d x ' Hl . Splitting this up in this way will 
only generate processes involving the sources and not the vacuum dia¬ 
grams, and so will result in the normalized generating functional. We 
can therefore write 


Z[J\ = (fl |7V -T d4x ( x ) | O), 


(22.5) 
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where the fields 4>u(x) evolve according to the full Hamiltonian (rather 
than a free Hamiltonian Ho used in the interaction picture) and we use 
the interacting ground state |fl) rather than the free ground state |0). 
Again we stress this is all still permissible, as long as we remember that 
we wouldn’t be able to use Wick’s theorem. We expand Z[J] to obtain 


OO • yi p 

Z[J\ = 1 + “T / d 4 xi...d 4 a; n J(xi)...J(x n ){Cl\T^u{xi)...^u(xn)\^)- 

n= 1 

( 22 . 6 ) 

Recalling that Green’s functions are defined by 

G^( Xl ,...,x n ) = (n\TM Xl )...Mx n )\n), ( 22 . 7 ) 


we notice that we can get access to them through a manipulation of the 
generating function Z[J] by differentiating eqn 22.6 as follows: 


G (n \ xi, -,x n ) 


1_ S n Z[J} 

i" 5J{xi)...8J{x n ) 


j=o 


( 22 . 8 ) 


That is, we get the Green’s functions (which contain all of the informa¬ 
tion about a system) by differentiating a single functional Z[J\. This 
method of doing quantum field theory is illustrated in Fig. 22.1. In terms 
of the unnormalized generating functional Z[J\ we have the equivalent 
rule 


G {n) {x i, ...,£„) 


1 1 S n Z[J] 

i n Z[J = 0] 5J{x\)...8J(x n ) 


j =o 


(22.9) 


which closely resembles the version from statistical physics. 

We’ve done what we set out to do: we have found a single functional 
that contains all of the information about the system, which can be used 
to find Green’s functions. All we need to do now is find an automated 
way of calculating Z[J] for any theory. It turns out that there are two 
ways: the first is by relating Z[J] to the S-matrix; the second uses 
Feynman’s path integral. In this chapter we pursue the first approach, 
leaving the second for later (Chapter 24). 


22.2 Linking things up with the 
Gell-Mann—Low theorem 

We’ll examine the first method: calculating Z[J\ from the S-matrix. We 
stress that the S'-matrix is defined for the interaction picture, so we’ll 
need an equation for Z[J] given explicitly in terms of interaction picture 
fields. We’ll pluck the answer from the air and then prove we’ve plucked 
correctly. We propose a form for the normalized generating function 
Z[J] which looks like: 



Fig. 22.1 Doing quantum field theory 
using generating functionals 


Z[J\ 


z\J] 

m 


(0|re _ 1 d4 ^fWi—^(:c)<?i(^)] |o) 
(0|Te -i / d4x7 A|0) 


( 22 . 10 ) 
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which looks like a good guess because it has the form of Z[J] in eqn 22.5, 
but this time writing the interaction Hamiltonian in the form — Jcj) j. 
This isn’t quite in terms of an expression for the S-matrix yet as we have 
the source term in the numerator. Using the rule that you differentiate 
n times, divide by i n and set J = 0 to yield a Green’s functions, we have 
the Green’s function 


G^ n \x i, ...,x n ) = 


(0\TMx 1 )...Mx n )e~ i f<i 4 ^i\0) 


(0|Te- i / d4xWl |0) 
and this can be written in terms of the ^-operator as 


G (n) (x i,... ,x n ) = 


(0|T^i(a;i)...^i(a;„)5|0) 


( 22 . 11 ) 


( 22 . 12 ) 


<0|S|0) 

All that’s left is now is to prove that we’ve made a correct guess, so that 




(0\TMx 1 )...Mx n )S\0) 

<0|A|0) 


(22.13) 


2 

The proof of the theorem can be found 
in the book by P. Coleman. It was 
first proved by Gell-Mann and Low in 
1951. Murray Gell-Mann (1929- ) re¬ 
ceived the 1969 Nobel Prize in Physics 
and, borrowing a term from Finnegans 
Wake , gave the name to ‘quarks’. Fran¬ 
cis Low (1921-2007) worked with Gell- 
Mann at Princeton, and spent most of 
his later career at MIT. 


This result turns out to be true and is known as the Gell-Mann—Low 
theorem. 2 Let’s be completely clear about what eqn 22.13 is saying. 
First examine the left-hand side: 

• |fl) is the interacting ground state, defined by the full Hamiltonian 
as H\fl) = 0. (j)u(x ) are Heisenberg field operators. They time- 
evolve according to the full Hamiltonian H. 

Next consider the right-hand side: 

• Since we’ve turned off the source term now, the Hamiltonian can 
be split up into free (solvable) and interacting (usually unsolvable) 
parts H = H 0 + H'. The right-hand side is therefore written 
in terms of the interaction picture. |0) is the free ground state, 
defined such that i?o|0) = 0. Quantities (j>i(x) are interaction 
picture held operators. These time-evolve according to the free 
part of the Hamiltonian Hq. 


22.3 How to calculate Green’s functions 
with diagrams 

We’ve seen how to get a Green’s functions from the generating func¬ 
tional of a theory. It’s simply a matter of differentiating. It’s natural to 
be curious about what happens if we try to directly relate the Green’s 
functions to Feynman diagrams. The Gell-Mann-Low theorem is a quo¬ 
tient: 

G (n)( \ = (0\TMxi)-Mxn)S\0) 

(0|S|0) 


(22.14) 
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Presumably there might be some cancellation between the Feynman di¬ 
agrams thrown up in the numerator against those coming from the de¬ 
nominator. It turns out that this cancellation does indeed take place 
and leaves us with a wonderfully simple result: 

o'” - ) ■ 

(22.15) 

Why is this? If we start with the Gell-Mann-Low formula (eqn 22.14), 
then the numerator factorizes into 


Recall from Section 19.3 (see particu¬ 
larly Fig. 19.4) that a connected dia¬ 
gram is made up of a single piece of 
linked lines and loops. A disconnected 
diagram is composed of a number (> 1) 
of separate pieces. 


<O|T0i(* 1 )...0i(a: n )^|O) = ^ 


Connected diagrams 
with n external lines 



Connected 
vacuum diagrams 


and the denominator is simply (0|<S I |0) = exp 
so we are able to show that 


E 


Connected 
vacuum diagrams 


G (n) ( Xl ,...,x n ) = (n\TMxi)...Mx n )\Q) 

Connected diagrams 
with n external lines 


E 



and that proves eqn 22.15, the most important result of this chapter. 


Example 22.1 

Let’s see how this unfolds for the single-particle propagator G^ 2 \x 1 , 0 : 2 ). A typical 
term will result in a disconnected diagram made up of a connected diagram with 
external lines and several connected vacuum diagrams. The value of the full diagram 
is the product of all of the connected diagrams. We label each connected vacuum 
diagram by some number i. There may be more than one copy of each connected 
vacuum diagram: we’ll call the number of copies rii. Call the value of a connected 
vacuum diagram Vi. Put all of this together and write the value of a full, disconnected 
diagram, representing one particular contraction, as 


{ Connected diagram \ -i-r 1 
y with external lines J f .-1 - 1 


(22.16) 


where we need a factor because we can interchange the rii copies of the connected 
diagram i. The sum of all diagrams is then given by 


E E 

Connected diagrams ( a ll f n i}) 
with external lines 


Connected diagram 
with external lines 



(22.17) 


where, to generate all possible disconnected diagrams, we have summed over all 
possible connected diagrams with external lines and over all ordered sets of integers 
{rii). To break this down we may factorize out the sum over connected diagrams 
with external lines: 



Connected diagram 
with external lines 


e 


(22.18) 
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Then the rest of the expression may be factorized thus: 


E 

E 

E 

E 


Connected diagram 
with external lines 

Connected diagram 
with external lines 

Connected diagram 
with external lines 



Connected diagram 
with external lines 




(22.19) 


Now, as long as we know the Feynman rules for a theory, we can for¬ 
get about the 5-matrix and the interaction picture. We’ve reduced the 
whole perturbation theory to one expression. To get the n-point Green’s 
function we just sum all Feynman diagrams with n external lines. Amaz¬ 
ingly simple. 


22.4 More facts about diagrams 

We’ve seen that G^ 2 ^(a;i ,£ 2 ), which is the amplitude for a particle to 
start at £2 and propagate to X\ . may be written as a sum of diagrams 
describing all the ways in which a particle can start at £2 and finish up at 
X\. We ask next, is there a way of expressing Z[J\ in terms of diagrams? 
Since Z[J] describes a situation where we have sources (and sinks) of 
particles, but that we start at t = —00 with no particles and end up at 
t = 00 with no particles, we might expect to be able to express Z[J] 
as a sum of diagrams describing ways of starting and ending without 
particles. This is indeed the case and we have 


Z{J] = (fi(oo)|fl(—oo))j = (0|Sj0)j 

E ( Disconnected vacuum and 
l source-to-source diagrams 


( 22 . 20 ) 


The sum here is made up of all of the ways of starting and ending with no 
particles, which corresponds to the sum of all disconnected vacuum and 
source-to-source diagrams, of which there are potentially a very great 
number. Fortunately there’s an even simpler way of stating this result, 
which is: 


'"'The linked-cluster theorem, which 
equates the right-hand sides of 
eqns 22.20 and 22.21, can be stated 
more memorably as 


E 


all 

diagrams 


E 


connected 

diagrams 


£ 


Z[J\ = e 


Connected vacuum and 
source-to-source diagrams 


( 22 . 21 ) 


This simplification is based on the linked-cluster theorem, 3 which 
we now show follows from more combinatoric arguments, very similar 
to those used earlier. We call a vacuum or source-to-source diagram 
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Vi- Now, in general, the amplitude for a general, possibly disconnected 
diagram is given by ‘ and the generating functional is given by 

the sum 

z =nE^=« E -‘''' < 22 - 22 > 


i rii=0 


rij\ 


proving the theorem. (Notice that all of the leg work was done in Ex¬ 
ample 22.1!) 


Example 22.2 


We can use the linked-cluster theorem to simplify the expression for the normalized 
generating functional Z[J ] = Z[J]/Z[ 0]. If we set J = 0, we turn off the sources and 
we have 


Z[Q\ = (Connected vacuum diagrams) 


(22.23) 


We write 


Z[J\ 



Connected source- 
to-source diagrams 


) (Connected vacuum diagrams) 


(22.24) 


The normalized generating functional is then 

/ Connected source- 
ZU] _ z[j] _ M to-source diagrams 
1 1 ~~ Z[ 0] " 

allowing us to give more meaning to the normalized functional Z[J], 


(22.25) 


Notice that we derive most of the results on the way so that we never have 
to use them. (One assumes this is a bit like learning knife fighting.) The 
end result for a particular theory is that the propagator is represented 
by a sum of connected diagrams. Of course, the details of what the 
diagams look like come from a one-off, nitty-gritty calculation. We’ll see 
a few more examples of these in the following chapters. Finally we’ll 
need to do the integrals. This will lead into even stranger territory. 


Chapter summary 

• Green’s functions are derived from generating functionals. This 
approach shows that G 1 '" 1 is simply the sum of all diagrams with 
n external lines. 

• The generating functional Z[J] is the exponential of the sum of all 
connected source-to-source and vacuum diagrams. 
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Exercises 


(22.1) Consider the forced quantum oscillator (22.2) 

L = ^mi(t) 2 — ^mu> 2 x(t) 2 + f(t)x(t), (22.26) 

where f(t) is a constant force fo which acts only for 
a finite length of time: 0 < t < T. We can use the 
5-matrix to work out the generating functional for 
this theory, by treating f(t)x(t) as an interaction 
term. 

(a) Explain why we may write (0|5|0) = Z[f(t)] for 
this theory. 

(b) By expanding the S-matrix and show that 
Z[f(t )] = e ( Dumbbe11 ), where (Dumbbell) is the 
Feynman diagram shown in Fig. 22.2(a) whose am¬ 
plitude is 

(Dumbbell) = ^ J dtdt'f(t){0\Tx(t)x(t')\0)f(t'). 

(22.27) 

(c) Working in frequency space, show that the 
Dumbbell may be written 

(Dumbbell) = -±- J ^/(- ^ 2 _^ 2+ie />), 

(22.28) 

where /(^) = ^ (l — e lvT ). 

(d) What does the quantity |Z[/(f)]| 2 describe 
physically? Using the identity * ie = P —inS(E), 
where V is the principal part (see Appendix B), 
show that this quantity is given by 

|Z[/(t)]| 2 =exp(-^sin 2 ^. (22.29) 



Fig. 22.2 (a) Dumbbell diagram for Exercise 22.1. (b) 
Dumbbell diagram for Exercise 22.2. 


Now consider scalar field theory in the presence of 
a source field 

£ = - r ^- ( t>{x) 2 +gJ{x)<j>(x). (22.30) 


Notice the resemblance between this theory and the 
forced oscillator. 

(a) Show that the amplitude for the Dumbbell dia¬ 
gram in this theory [Fig. 22.2(b)] is given by 


(-i3) 2 


Jd 4 xd 4 yJ 


d4p J(x) 16 


-ip-(x-y) 


(2 


p 2 — m 2 + ie 


J(y)- 


(22.31) 

(b) Consider a source field describing two static 
sources localized at x\ and X2- 


J{x) = S (3> (x — xi) + S^ 3 \x — X 2 ). (22.32) 


Show that the interaction between the two sources 
can be written 


(Dumbbell) 12 oc —i g 2 



-y°) 


x 


d 3 p 

(2-k) 3 p 2 — m 2 + ie ’ 


where the self-interaction of the sources is ignored, 
(c) Show further that 


r r u3 ip-(xi —*2) 

(Dumbbell)., oc ig 2 / da; 0 / —— — - -—. 

V 12 J J (2tt) 3 p 2 + m 2 

(22.33) 

(d) Finally, using the fact that e~ lET = (0|5|0) = 
e (Dumbbell), w i lere y j s the time over which the 
sources act (see Exercise 43.5 if in doubt), show 
that the interaction energy E between two spatially 
separate sources is given by 


E oc -g 2 


/ 


d 3 p e ip (a,1_a,2) 
(2-7 t) 3 p 2 + m 2 


(22.34) 


The important feature here is that the energy is neg¬ 
ative. The two sources therefore attract each other 
by exchanging a virtual particle of mass m. 

*(e) Compare the previous result to the correspond¬ 
ing one for a (spin-1) vector field dealt with in 
Chapter 24. Is the interaction attractive or repul¬ 
sive? A graviton is a spin-2 object. What would 
you expect for the interaction of sources in a gravi¬ 
tational field. See Feynman’s Lectures on Gravita¬ 
tion, Chapter 3, for a discussion. 



Part VI 


Path integrals 

In this part we use a reformulation of quantum field theory which is due 
to Richard Feynman, the so-called path integral approach. Quantities 
of interest in quantum field theory, such as the propagator, can be cal¬ 
culated using a functional integral, which is an integral over all possible 
paths of the particle from one spacetime point to another. 

• We describe Feynman’s path integral approach in Chapter 23 and 
demonstrate how to evaluate certain Gaussian integrals, using the 
simple harmonic oscillator as an example. 

• These ideas are then applied in Chapter 24 to working out func¬ 
tional integrals and thus calculating Green’s functions. 

• There is a rather subtle connection between quantum field theory 
and statistical physics, and this is explored in Chapter 25, where 
the concepts of imaginary time and the Wick rotation are intro¬ 
duced. 

• An important concept is symmetry breaking and this is treated 
in Chapter 26. This idea is introduced in the context of phase 
transitions and we show how to describe symmetry breaking with 
a Lagrangian. Breaking a continuous symmetry leads to Goldstone 
modes and we also show the consequence of symmetry breaking in 
a gauge theory, thereby introducing the Higgs mechanism. 

• Chapter 27 describes coherent states which are eigenstates of the 
annihilation operator. These states do not have a definite number 
of particles but they can be made to have a well-defined phase and 
are used to describe laser fields and superfluids. 

• We introduce a new type of mathematical object in Chapter 28: 
Grassmann numbers. These anticommute and can be used to de¬ 
scribe fermions. We show how to construct a fermion path integral 
using Grassmann numbers. 
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Thirty-one years ago Dick Feynman told me about his ‘sum 
over histories’ version of quantum mechanics. ‘The electron 
does anything it likes’, he said. ‘It goes in any direction at 
any speed, forward and backward in time, however it likes, 
and then you add up the amplitudes and it gives you the 
wave-function.’ I said to him, ‘You’re crazy’. But he wasn’t. 
F. J. Dyson (1923- ) 


In this chapter, we introduce the path integral formulation of quantum 
mechanics. 


23.1 How to do quantum mechanics using 
path integrals 



Fig. 23.1 Some possible paths with 
their amplitudes. A particle will take 
all of them. 


Quantum mechanics is all about probability amplitudes. One way to do 
quantum mechanics is to ask a question like ‘what’s the amplitude that 
a particle starts at point q a at time t a and ends up at point q\, at time 
t\,V The amplitude for the process is, as we’ve discussed, known as the 
propagator. Previously, we found the propagator using a conventional 
approach to quantum mechanics espoused by Schrodinger and Heisen¬ 
berg. In this chapter we’ll find the propagator using Richard Feynman’s 
alternative approach. This will then be applied to quantum fields. 

The propagator depends on the trajectory that, for whatever reason, 
the particle takes in getting from A to B. Some possible paths are shown 
in Fig. 23.1. In classical mechanics, we’d know what to do: write down a 
Lagrangian L; plug that Lagrangian into the Euler-Lagrange equation 
and thus generate the equations of motion, which we’d then (at least 
try) to solve. The trajectory that we’d calculate would be the one that 
minimizes the action f 

S = I d tL[q(t)], (23.1) 

Jt A 

just as we discussed in Chapter 1. 

How does this work in quantum mechanics? Richard Feynman’s sug¬ 
gestion was that in getting from A to B a particle will take every pos¬ 
sible trajectory. We’ll say that again. A particle will take every single 
possible trajectory, forward and backward in time, zig-zagging, loop¬ 
ing, whatever. To get the quantum amplitude Feynman says that each 
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trajectory contributes a complex factor e lS / R , where S is the action de¬ 
scribing that trajectory, and we add up the contributions to get the 
amplitude. Crazy! 

Let’s see if we can justify this approach of getting the amplitude 
by summing over e lS ^ h factors associated with each trajectory. This 
is also helpfully known as the sum over histories approach because 
the present wave function is obtained by summing over all possible past 
trajectories. The propagator amplitude G to get from spacetime point 
A [= (t a ,<7a)] to point B [= (t b ,9b)] may be written as (q h \U(t h , t a )\q a ), 
where | q) is an eigenstate of position and U(tb,t a ) is the time-evolution 
operator U (i b , t a ) = (where we’ve returned to our usual units 

employing h = 1). We have then 

G=(q h \e- i ^- t ^\q a ). (23.2) 

To deal with this we split the trajectory into N infinitesimal steps of 
length At as shown in Fig. 23.2. This is known as time-slicing. The 
fact that U(t) is a unitary operator allows us to time-slice. Unitarity 
lets us say U(tb — t a ) = U(tb~ t x )U{t x — t a ) and so time-slicing U(t) = 

e -i H(t b -t a ) j n thjg wa y yi e i c i s 

G = (q h \(e- i&M ) N \<l,) 

= (q h \e-^...e- i » At ...e-^\q a ). (23.3) 

Notice that each of these mini time-evolution operators moves the par¬ 
ticle for time At between two closely spaced positions q n and q n + i (say) 
as shown in Fig. 23.2. 


t 



Fig. 23.2 Time slicing a trajectory 
from (t a ,g a ) to (tb, 9 b)- 


Example 23.1 

Remember that in quantum mechanics we often make use of the completeness of a 
set of states. For the position states q n , we have f dq n \q n )(q n \ = 1. The trick to 
get any further with G is to insert a ‘resolution of the identity’ (also known as a fat 
unity) in between each mini time-evolution operator. We’ll need N — 1 fat unities to 
do this. Inserting these in (working from right to left) we obtain 


G = 


(%|e lHAt j dqjv— i l<ZiV— l) (<?JV— 1 
...e~ i6At 
...e~ i6At \ [ 


g —i-STAt _ 


J d<J„ + i|g„+i)(<2n+l| e lHAt J 


c\n„ 


f , , W , 


Notice what’s happening here. In each resolution we’re varying the g n ’s indepen¬ 
dently and making new trajectories as we do so. In Fig. 23.2 this corresponds to 
moving the blobs along each line independently to some new position to make a new 
trajectory. We take a snapshot of the trajectory and then move the blobs again. The 
path integral is done by adding up all of the different trajectories that you make. In 
this way we add all of the possible paths between q a and q 5 . A little rearranging 
reveals that we have 

G = J dgid® ■ ■ -dgjv.i (9b|e _lWAt |9jv-l> • • • (q n +l\e~ lHAt \q n ) ■ ■ ■ (qi\e~ lHAt \q„). 

(23.5) 
















212 Path integrals: I said to him, ‘You’re crazy’ 


1 There’s some sleight of hand 
here. What we’re saying is that 
e -i(T+V)At _ e -iTAt e -iVAt + 

0(At 2 ), and since At is assumed 
small, all is well. In general if two 
operators have a commutation relation 
[A,B] = C then e A +z = e A e B e -C/2 
Here we take A = —iTAt, B = —iV At 
and so C = — (At) 2 [T, V”] and thus 
e~ C / 2 « l + 0[(At) 2 ]. 


o 

The result we 
f°° dxe~^- +bx 

J — OO 

a = iA t/m and b = 


need is 



i((?n+l Qn)- 


that 

with 


Between the two amplitudes at the end we have a string of mini-propagators G n = 
(q n +i\e~ iHAt \qn)- Let’s look at one of the typical mini-propagators: 


G n 


= ( 9 n+l|e lHAt \qn) 


= ( 9 n+l|e 


-i A * 


l<Zn), 


(23.6) 


where we’ve written H = p 2 /2m + V(q). To evaluate the mini-propagator we need 
to replace the operators by eigenvalues. The potential term acts 1 on the position 
state as follows: e~^^ At |g n ) = \q n )e~ lV ^ qn ^ At [we’ve moved the state | q n ) past 
the potential operator V(q), replacing the operator with its eigenvalue V(q n )\. We 
have then 


G n = (q n + i|e l 2 m At \q n )e 


iV(q n )At 


(23.7) 


Next we need to know how p acts on a position state. Since |g n ) isn’t an eigenstate 
of p (since [q,p] = i) we just expand |g n ) in terms of momentum eigenstates in the 
usual way |<j n ) = f j^\p){p\q n ) = f dp\p)e~ lpqn /(2tt) 2 . Replacing \q n ) in this way 
gives us 


G n = 


J 

I 

I 

I 


dp 

( 2 tt) 

dp 


r( 9 n+i|p)e ‘2 


„-iV(q n )At 


~ i i^ At p- i P'ln p -iV(qn)At 


a ip 9 n + lg-i^j 2 \t -ipq„ ~lV(q n )At 


(2tt) 
dp 

wy 

-g n ) c -iV(q n )At 

(27r) 


(23.8) 


We’ve now removed all of the operators in favour of numbers and are left with an 
integral over p to do. This is a Gaussian integral which can be done exactly (see the 
next section 2 ). The result is 


= / -i m 

n V 2 tt At 


i m (Qn + l-Vn) 

e 2 St 


±-iV(q n )At 


(23.9) 


Inserting back into eqn 23.5, and writing the factor (—im/2-7rA£) 2 = £ 1 , we find that 
the propagator amplitude G can be written as a product of these mini-propagators 


G = 


N-l 

n 

n=l J 


d qn 

T 


(<?n 


1 —Qn) 


(At ) 2 


p -iV(q n )At 


(23.10) 


Lastly we take the limit N —> oo, At —> 0. This corresponds to upping the contrast 
on our grid in Fig. 23.2, turning the jagged trajectories into smooth curves. Taking 
these limits turns into q 2 and y^ n A t —> J d t. _ 


We obtain our answer for the total amplitude 

G = y'r»[g(t)]e i/dt [ 2S ^" y(9) ], (23.11) 

where we define the integration measure J V[q{t)\ as 

/!>[,(()]= lim It /x’ (2312) 

J n—1 J S 

Equation 23.11 is an example of a functional integral, a sum over all 
possible trajectories encoded as a vast multiple integral over all time- 
sliced coordinates q n , and in fact as we let IV —> oo the functional integral 
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becomes an infinitely multiple integral. The curly T> hides the guts of 
the functional integral: the instruction to integrate over all possible 
trajectories. 

Noticing that the Lagrangian is L = mq 2 / 2 — V{q) and restoring h , 
we recognize 

G = J V[q(t)]e*f dtL bW = j V[q(t)}e iS/h , (23.13) 

which is what we set out to justify. To recap: we calculate an amplitude 
by adding up contributions e lS / h from each possible trajectory, of which 
there are an infinite number. 3 

What does the path integral tell us about quantum amplitudes? Al¬ 
though we add up all of the paths, each path is weighted by a phase 
factor, and we expect that there’s some degree of cancellation between 
the contribution from different paths. Also, as we approach the classical 
world (mathematically achieved by sending H —> 0) we should recover 
Lagrange’s result that the trajectory the particle takes is the one for 
which the action is stationary, i.e. 6S/Sq(t ) = 0. This can be under¬ 
stood by considering a particular trajectory with action S, typically 
large compared to h, so that its contribution is weighted by e lS ^ h (the 
phase S/h can be anywhere on the Argand clock). A close neighbouring 
trajectory will have a different action S 1 , also large compared to h , and 
if SS = (S' — S) h it is likely to have a completely different phase. 
It doesn’t take much imagination to realize that a bunch of such tra¬ 
jectories will all have essentially random phases and will cancel, giving 
no net contribution. In contrast, trajectories sufficiently close to the 
stationary value that Lagrange would have predicted (some are shown 
in Fig. 23.3) all have similar actions and thus similar phases, giving a 
strong in-phase contribution. Therefore, the sum is dominated by the 
stationary result. The quantum mechanical effects, known as quantum 
corrections or quantum fluctuations, are given, for the most part, by 
the trajectories close to the classical path of least action. 4 


23.2 The Gaussian integral 


We’ve outlined a new way to think about quantum mechanics, but it’s 
not worth much if we can’t actually do a path integral. In this section 
we’ll build up a mathematical armoury to deal with evaluating path 
integrals. It turns out that it’s all related to a simple Gaussian integral 
and we’ll build it up via a series of successive steps of generalization. 
Step 0: The basic Gaussian integral around which this section is based 
is given by the standard integral 5 



(23.14) 


Step 1: The first generalization involves rescaling x 2 —» ax 2 /2 to get 



(23.15) 


3 

Doing the sum relies on the shady 
business of defining an integration mea¬ 
sure f T>[q(t)] which doesn’t look very 
well behaved in the limit N —> oo. 
Although this is a legitimate concern, 
we’ll see that we’ll always be able to 
manoeuvre ourselves out of the trouble 
caused by f "D[q(t)]. 



Fig. 23.3 The classical path is shown 
as a solid line, alternative paths allowed 
by quantum mechanics are shown as 
dashed lines. 

^An approach to path integral calcu¬ 
lations where we find the classical tra¬ 
jectory and then find the approximate 
form of the quantum corrections, is 
known as the stationary phase approx¬ 
imation and is used in Chapter 50. 

5 Write I = f°° dxe~ x2 and I = 

J —oo 

f^° dy e~ y , then multiply the two 
equivalent integrals together to obtain 

1 2 = r r dxdye-^ 2 +y 2 ) 

J x=— oo J y= — oo 

r 27t poo 2 

= / d 0 dr re~ r , 

Je =0 Jr =0 

where in the last equality we’ve 
switched to polar coordinates. These 
are standard integrals, yielding 1 2 = 
(27r) x ^ = 7T and hence I = y/ir. 
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^For simplicity here we label ordinary 
vectors in Euclidean space with indices 
in the down position and assume a sum 
over repeated indices. We will return 
to our usual vector notation in the next 
chapter. 

This is nothing to worry about as 
we’re used to doing multiple dimen¬ 
sional integrals over volumes or sur¬ 
faces. Remember that the tricky thing 
about these integrals is the dependence 
of one component on another. Things 
are a lot simpler if we can separate the 
components out. That’s exactly what 
we’re going to do here. 


Step 2: A very important variation, which turns out to be the useful 
one for evaluating path integrals, is the integral: 


dxe 


— OX -J_^ x _ 


2ir bl 
e 2 « 
a 


(23.16) 


which is also not too difficult to prove, as shown below. 


Example 23.2 

Equation 23.16 is proved by ‘completing the square’. The maximum of P(x) = 

2 ,2 I 

— —b bx is Pmax = at x = -. We rewrite 


P(X) - Pi 


max — 


ax* 
~2~ 


— -- I x - 

a 


b 2 

■ bx - 

2a 
2 


(23.17) 


If we now change variables y = (x — so that dx = dy, then eqn 23.16 becomes 




drr e 


- -\-bx _ 


/: 


d * e -§(*-!) 2 + ^ 


si r 

2 a 

J 00 


d ye 


27T b 2 

e?a . (23.18) 

a 


There are some things to note about this equation for later use. Most 
of them are obvious, but important. 

• Since x was our integration variable it doesn’t appear on the right- 
hand side. We say that we have ‘integrated out’ x. 

• We are left with an expression which has the structure 

J dx e~ x % x+bx = (23.19) 

where on the left-hand side a touches two x’s and b touches one, 
while on the right-hand side the two 6’s are linked by the ^. 

We’ll see this structure again a little later. 

Step 3: In order to be able to integrate over functions like q(t), we 
have to consider an intermediate step: integrating over vectors. This is 
because a function can be thought of as an infinite dimensional vector. 
We start with an fV-dimensional vector x, with components 6 Xj and 
consider the integral 

1= J d Xl dx 2 ...dx N e-^ XiAi i x * = /d^e-i xTAx , (23.20) 

where A is an N x N, real, symmetric matrix.' 



Example 23.3 

To evaluate the integral we make an orthogonal transformation A = O t DO, where 
we choose our matrices O in such a way as to make D a diagonal matrix. Our integral 
is then 

X = j d ff ie _ 5 xT ° TDOx . (23.21) 

Next make the transformation Ox = y, and since the matrix O is orthogonal, the 
Jacobian of this transformation is 1 so f d N x = f d N y, and we get 

X = j d N ye-^ yTr,y . (23.22) 

As D is diagonal, we have y T Dy = JT Dn{yi ) 2 , which allows us to separate out our 
multidimensional integral. 

jd N ye-i yTDy = 


where we’ve used Da = det D = det A. 


/ 


dyi e 


Dnivir 


J 


dy 2 e 


. d 22(V2 ) 


-/ 


dy N e 


( 2?r \ 2 / 2-k \ 2 / 2tt \ i 

V^ii/ \D22J \Dnn) 


n(£r = 


D 


(2n) N 

dctD 


(2ir) N 
det A 


. d nn(vn ) 2 


(23.23) 


Step 4: We’re finally in a position to generalize our important result, 
eqn 23.19, to the multidimensional case. We want to evaluate the inte¬ 
gral 

K = j d Jv a;e-5 xTAx+bTx , (23.24) 

where b is an A^-dimensional vector. 


Example 23.4 

By analogy with Example 23.2 we consider P(x) = — ^x T Ax + b T x. We can easily 
find its minimum (remembering that Aij is symmetric and using 

= --SikAijXj - -XiA i:j 5 jk + bjS jh 

— AkjXj T bk 

= —(Ax — b)^ = 0. (23.25) 

This gives us a minimum of P m i n = b A 2 —— at x = A -1 b. So again, just as in 
eqn 23.17, we can write 

. 1 b T A- 1 b 

P(x) - P min = --x t Ax + b T x--- 

= -^(x-A-^Afx-A-'b). (23.26) 

This all looks familiar, of course. Again, we can perform a transformation and 
integrate over our new variable y = (x — A -1 b). We’ll obtain a factor of the form 
e 2 emerging at the front. 
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The previous example leads us to 


K = 


l N xe~^ 


1 Ax+b 1 


( 2 ^ 

detA 


1 

2 


e 2 


b T A“ 1 b 


(23.27) 


Again we note that we’ve integrated out x, and while A touches two x’s 
on the left-hand side, A -1 takes b to b 1 . 

Step 5: Functions can be perfectly well described as infinite dimensional 
vectors, and so our next generalization is the functional integral 


Q 



f dxdy f(x)A(x,y)f(y)+f dx b(x)f{x) 


(23.28) 


To deal with this we’ll revert to the discrete world by turning the func¬ 
tions into iV-dimensional vectors; we’ll write down the solution and then 
upgrade the result back into the world of functions. 


Example 23.5 

We convert A(x,y) —► A, f(x) —> x and so on to obtain 


I T>[f(x)\ e _ i I dxdy f( x ) A ( x ’y)f( y )+f dxb ( x )f( x ) _> J d N xe~ i xT Ax+b T x 


det A 
(27r) 


e 2 b T A ~ 1 b 


(and then returning to functions) 


N \ 1/2 


det A(x, y) 


f dxdyb(x)A 1 (x,y)b(y) 


(23.29) 


We have the final result that 

V[f{x)] e ~3 f dxdy f(x) A {x, y )f{y)+f dxh(x)f(x) 

= B[detA(x,y)}~ 1/ \ 1 ^ dxdvHx)A ~ 1( - x ’ yWv \ (23.30) 

where we’ve introduced a constant B to soak up the factor of (2^)^ 
(which is a potential embarrassment in the limit N —> oo). 

The inverse function A _1 (x,y) is that function that satisfies 



8 The key path integral equation 
(eqn 23.30), which we’ll use most fre¬ 
quently in later chapters, may be sum¬ 
marized as 


J V[f}e-^S fAf+ S bf = Ne^ bA lb , 


where A is a constant. 


J d zA(x,z)A 1 (z,y) = S(x — y). 


(23.31) 


That’s just another way of saying that A~ x {x, y) is the Green’s function 
for the operator A(x, y). It’s been a long slog, but we can now do one sort 
of functional integral, 8 as long as we can find the determinant det A(x, y) 
and find the Green’s function of A(x,y). In the next section, at long 
last, we’ll finally do a functional integral! 
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23.3 The propagator for the simple 
harmonic oscillator 

An illuminating example of the path integral approach is the simple 
harmonic oscillator. We’re going to find the probability amplitude that 
if, at f a = 0, we put a particle in the minimum of a harmonic potential 
located at q = 0, when we come back at time % = T the particle will 
still be at q = 0, where we originally put it. We need, therefore, to 
compute the propagator 


G{q h = 0, t h = T , q a = 0, t a = 0) 


Particle starts at 

<7a = 0, t a = 0 


Particle ends at 
<7b = 0, t h = T 

I V[q{t)]e^o ^L[ q (t))_ 

<Za=0,*a=0 


(23.32) 


Let’s do the integral. The Lagrangian for a simple harmonic oscillator 

is L = mq ^ - "'" J ° 2 g(,) , where wo = ( K/m ) 1 ^ 2 . The path integral is 

therefore 

G = J V[q{t)]e^ Idt i^^ - "“ a(t)a ]. (23.33) 

Unfortunately this is not in the form of the one integral we know how 
to do (that one looks like f Vqe~^ qAq+bq ). Can we massage the simple- 
harmonic oscillator into this form? We can and the trick, which is based 
on integrating by parts, is one that we’ll use repeatedly for path integrals. 


Example 23.6 

The action in the argument of the exponential part of the path integral is 


S = 




mu>Qq(t ) 2 

2 


(23.34) 


and the trick just involves integrating the first term, which represents the kinetic 
energy, by parts 


giving 



(23.35) 


(23.36) 


The boundary conditions (q = 0 at the beginning and end of the trajectory) wipe 
out the first term and the second term has the form ‘ qCq\ where C is an operator. 


The use of our neat trick gives the functional integral 
G(0, T, 0,0) = f V[q{t)]e^ dt ^ t)dq(t \ 


(23.37) 
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with C = uz (—2 — Wq). We can do this integral using a slightly 

modified form of the result in eqn 23.30. Setting A(x,y ) = —i C and 
b(x) = 0 in eqn 23.30, we discretize the integral to obtain 

G —> [ (\(ji dq-2 ■ ■ ■ dqN e 2 liCij'H _ Z' ) . (23.38) 

J \ det C / 

Again this fills us with a sense of dread since the result is quite frighten¬ 
ing with its factor (2^)^ that won’t look pretty as N —» oo. Undaunted, 
we’ll assume that when we pop back into the world of functions the an¬ 
swer will at least be that —> G <x [det C(x, y)] -1 / 2 . Admittedly, the 
determinant of the differential operator C(x,y) might also send a chill 
down the spine. To give such an object meaning, we can find the eigen¬ 
values (and eigenfunctions) of the operator and use the fact that the 
determinant is given by the product of eigenvalues. Our answer is then 

(23.39) 

where B is some constant coming from the integration measure that 
we’ll worry about shortly. 


G(0, T, 0, 0) = 5 det 


d 2 


Example 23.7 

The next job is to work out the determinant. This we do with the trick described 
above that makes use of the fact that the determinant of a matrix is equal to the 
product of its eigenvalues. Finding the eigenfunctions of the operator C isn’t too 
tough either. You can check that m q{t) = \ n q{t) has eigenfunctions 

q(t) = csin(w n £), with eigenvalues A n = — cJq), of which there are an infinite 

number. Remembering that our boundary conditions are that q = 0 at t = 0 and 
q = 0 at t = T we need u n = nn/T for everything to work. Putting this all together, 
we have an answer 



(23.40) 


At this stage we notice that the answer isn’t very useful in its present form. This is 
because the infinite product looks like it gives infinity whenever cjoT = mr and we 
don’t know what B is yet. 

A sanity check is useful here. When there’s no spring constant cjo = 0, there’s no 
harmonic potential and we have a free particle. In that case we have 


Gu,=o(0,T,0,0) = B 



(23.41) 


but we already know (from Example 16.5) that for a free particle Gf ree (0, T, 0, 0) = 


f —i m \ 2 
, 2t tT ) • 


This will be enough to provide the proportionality constant B for our 


problem. 
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We multiply eqn 23.40 by Gf ree /G„ 0= o = 1: 


G(0, T, 0,0) = G(0, T, 0, 0) 


Gf ree (0, T, 0, 0) 


G„ o=0 (0,T,0,0) 

B{nn=i™[m 2 -“Z]Y 


B 


n 


{Un=l 


1-f^Y 


( —V 

V2 t tTJ 


/ —i m\ - 
V 27rT ) 


(23.42) 


which removes all of the dangerous terms. Finally we use the identity j [jbj [ 1 — 
( x/nir ) 2 ] -1 = xj sin a; and write 


_ / uqT \ 2 / —im\ 


G(0, T, 0, 0) = ( -4- - 

\sinmoT' / \2nT J 


(23.43) 


The final answer is that 

G(o,r,o,o) 


—imcoo 
2ir sin loqT 


(23.44) 


and the probability density |G| 2 derived from this is shown in Fig. 23.4. 
Interestingly, the probability density peaks strongly for T = mr/uio, 
where n is an integer, making it very likely that the particle will be 
found at the origin at these times. Perhaps this shouldn’t come as too 
much of a surprise: we know that classical particle oscillates back and 
forth in an harmonic potential. From the path integral point of view, 
the peaks in probability density result from the constructive interference 
of all of the possible particle paths that start and end at the origin in a 
time T; while the minima result from a large degree of cancellation of 
such path sections. It’s as if the paths at the origin go in and out of 
focus as a function of time! 


Chapter summary 

• Feynman’s path integral approach involves finding amplitudes by 
adding up contributions from all possible trajectories. Each tra¬ 
jectory makes a contribution exp(i S/K). 

• One type of functional integral has been evaluated and yields the 
result f V[f]e~i Sf A f+f b f = ^ ^etA ) ~ e^ bA 1& . 

• The simple harmonic oscillator provides an example showing the 
infinities that have to be tamed in a typical calculation. 


10 



0 12 3 4 

u) 0 T/tt 

Fig. 23.4 The (scaled) probability 
density |G(0, T, 0, 0)| 2 as a function of 
time for a particle that starts at the ori¬ 
gin of a harmonic oscillator potential to 
be found there a time T later. 





220 Path integrals: I said to him, ‘You’re crazy’ 


Exercises 


(23.1) A back of the envelope derivation of the most im¬ 
portant result of the chapter. 

Start with a Lagrangian L = |xAx + bx, where A 
is an operator. We will be brash and treat A as a 
number. Use the Euler-Lagrange equation to find ^3 3 ) 
x and show that the Lagrangian may be expressed 
equivalently as L = ~bjjb. 

(23.2) The path integral derivation of Wick’s theorem. 

(a) Calculate dxx 2 e~^ ax . 

(b) Define 

r da; x n e 

J — OC 


— 77 ax 


(x n ) = 


da; e 

J —OO 


(23.45) 


Calculate (i) (a; 2 ), (ii) (x 4 ) and (iii) (x n ). 

(c) Make sense of the result for ( x n ) diagrammati- 
cally, representing each factor of - as a line linking 
factors of x. 

(d) Now consider the A-dimensional vector x and 
the integral 


K. = 


dxi ... dxN e 2 


{2tt) n 
det A 


— )x T Ax|b T x 


(23.46) 


By differentiating with respect to components of the 
vector b, and then setting b = 0, show that 

, f dxi ...dxNXiXje~5 xTAx x 

{xiXj) = --- 1 TA _ = (A 


f dxi... dxjv e 


ij. 

(23.47) 


(e) Using these results, argue that 
( XiXjXkXi } (A )ij(A ) ki Y (A )jfc(A )j; 

+(A- 1 ) i ,(A" 1 )^. (23.48) 


(f) Write down an expression for the general case 
(Xi . ..x z ). 

This is the basis of Wick’s theorem in the path in¬ 
tegral approach. 

Consider the forced quantum oscillator from Exer¬ 
cise 22.1 


L = -mx(t) 2 — -mu> 2 x(t) 2 + f(t)x(t). (23.49) 

Take f(t) to be a constant force /o which acts only 
for a finite length of time: 0 < t < T. 

(a) Show that the amplitude for a particle that is in 
the ground state at t = 0 to be in the ground state 
at t = T is given by 


_ g-j J dt'dtf(t)G(t,t')f(t') 


where 


Oft - f')e“ i “ (t-t ' ) + 9(t' - t)e iu ' (t_ * ,) 
2 mui 


(23.50) 


G(t,t') = 

/./rn.ti ; 

(23.51) 

(b) Carry out the integral (being careful of the lim¬ 
its) and show that the amplitude is given by 


A = exp 


ig^( T _™^ + i 2 smS ^\ 

Amur 1 \ u) ui 2 J 


(23.52) 

(c) What is the probability for the oscillator to still 
be in its ground state at t = T? Compare this with 
the result of Exercise 22.1. 

(d) What is the physical meaning of the imaginary 
part of the argument in the exponential? 
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Life is not a walk across a field 

Russian proverb, quoted by Boris Pasternak (1890-1960) in 
his poem Hamlet 

In this chapter we will apply the functional integrals of the last chapter to 
quantum fields. Remarkably, the generating functional Z[J\ for quantum 
fields can be written as a functional integral. In this chapter we’ll find 
that integral. Using functional integrals can be seen as an alternative 
to canonical quantization and the 5'-operator (as shown in Fig. 24.1) 
although it’s best to have the two approaches available since both have 
their strengths and weaknesses. 
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24.1 The functional integral for fields 

In the previous chapter we calculated a propagator amplitude for a par¬ 
ticle by doing a functional integral. This involved summing all possible 
trajectories for a single particle travelling between two spacetime points. 
Our result was that 

G{x,t x ,v,ty) = (24.1) 

However, we know that single-particle quantum mechanics isn’t enough 
to describe reality: we need fields. We therefore want to find a functional 
integral that sums all possible field configurations that can exist between 
the two spacetime points. The bad news is that this integral won’t yield 
a propagator. In the next example we’ll pursue a method of integrating 
over field configurations and in the following section we will show (the 
good news) how this leads directly to the generating functional. 


Example 24.1 

The jump between integrating over particle trajectories to integrating over field con¬ 
figurations is quite straightforward. We can still write down the e lS / h factor for 
fields, but now the action S is the integral of the Lagrangian density over space- 
time: S = f d^x C[<ft(x)]. Note that the fields appearing here are classical Heisenberg 
fields. Their time dependence is given by the full Hamiltonian and they haven’t 
been through a canonical quantization machine so have no commutation relations. 
A functional integral over fields may then be written as 

j v[4,{x)\ e iS --o dixC[ ^ x)] . 



Fig. 24.1 Using functional integrals to 
do quantum field theory. 


(24.2) 
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Here’s the recipe to revert back from 
continuous functions to a discrete lat¬ 
tice: 

• The field </>( x) evaluated at a 
spacetime point x becomes an 
AT-component vector qj , where 
j = 1,... ,N labels a point in 
spacetime. 

• Differential operators become 
real symmetric matrices. Things 
like ^ become ; - qj) = 

Aij qj. 

• Integrals become sums: f dx be¬ 
comes l . 


The difference with the path integral that we had before is that the dynamic variable 
is now the field (f){x ) rather than the trajectory q(t) and the integral is therefore over 
functions of position and time rather than just the functions of time. Despite this 
difference, we do the integral in the same way as before: we discretize. Previously we 
just sliced up time, now we slice up spacetime. We obtain a multiple integral over N 
spacetime variables. 

We know how to do the integral for the special, but important, case that the 
Lagrangian can be worked into the form L[q] = Aq + i b T q. Based on the 
observation that 


I 


dxeh ax2 + ibx 



(24.3) 


we can read off that the answer is 


V = j d Jv ge5 qTAq+ibTq = 


( 271 ~ i) 

detA 


N\ 2 


(24.4) 


Our important result for a calculable functional integral over fields is that 

,} / d 4 xd 4 y <j>(x)A(x,y)<j>(y)+i f d 4 x b(x)</>(x) 


!■ 


T>[(f)(x)] e 2 • 

= B [detA(x,y)\~^ e - 5 f d4xd4 v b ( x ) A ^ 1 ( x ’V) b (v) } (24.5) 

where we’re hidden the factor (2^1)^ in the (divergent) constant B as we did before. 


24.2 Which field integrals should you do? 


We’re now at the stage where we’d like to have a go at doing a functional 
integral over some fields. Before getting too excited about all of this, we 
need to decide which integrals are worth doing. What are we trying to 
learn? Which fields do we start at, and which do we end with? 

We want access to all of the Green’s functions for a theory, since these 
contain all of the physics. If we only want to work out one thing, then 
it should be the generating functional Z[J\. We saw before that this 
functional contained all of the Green’s functions and therefore all of the 
information for the theory. Remember that the generating functional 
Z[J] tells us the amplitude 


Z[J\ = 


no particles 
at x° = (X) 


no particles 
at y° = — oo 


(24.6) 


^Also remember that our goal is ac¬ 
tually the Green’s functions. To 
get to these we can take func¬ 
tional derivatives of the generat¬ 
ing functional i n G^ n \xi, ...,x n ) = 

zfjJ 6J(xi)...SJ(x n ) | J=0 ( ec l n 21 - 10 )' 
o 

Note that this is the unnormalized 
generating functional. Recall that to 
obtain the normalized generating func¬ 
tional we can simply divide through by 
Z[J = 0] since Z[J\ = Z[J]/Z[ 0]. 


that is, in the presence of a source, we start with no particles and end 
with no particles. 1 The remarkable answer turns out to be that the 
generating functional can be written in terms of the functional integral 

J Aix{c[ ^ x)]+J{x)rl>{x)} . (24.7) 

In order to go about calculating Z[J], we start by splitting the La¬ 
grangian up into two parts £ = Co + Cj. As usual, £o is the free part: 
it’s the part that can be cast in the quadratic form f>(x)A<f>(x ) and is 
therefore solvable by our one known field integral. The other term C\ is 
the interaction part: that is, the part which isn’t solvable by canonical 
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quantization. Written in terms of functional integrals, the generating 
functional is therefore 

Z[J] = J V[(j)(x)}e l $ dix ( Co+Cl+J ^. (24.8) 

Now let’s try to do the integral. 

24.3 The generating functional for scalar 
fields 

We’ll calculate a generating functional for the free scalar field. Following 
the custom that quantities involving free fields are given a subscript ‘0’ 
this will be denoted Z 0 [J\. The free Lagrangian is given by 

1 777 ^ 

£o = -(^) 2 -—^ 2 . (24.9) 

The generating functional Zq[J] for the free scalar field is 

Z 0 [J] = 

As we’ve said above, we can do the integral if we massage Z 0 [J] into 
the form of eqn 24.4. We again invoke the neat trick of doing by parts 
the integral of the kinetic energy part of the Lagrangian density in the 
argument of the exponential (e j -f d * ) : 

J d 4 x {d^f = j d 4 xcj)d (24.11) 

where we assume that the first term disappears at the boundary, that is 
the field dies off as we head out to spacetime infinity. This trick enables 
us to make the replacement / d 4 £ (d^cf) 2 —■► — / d 4 a :<j>d 2 (f> and allows us 
to rewrite the functional integral as 

Z 0 [J] = J S Aix *\-~ (d2+m2) \ 4 ’ + ' 1 S dixJ<1, , (24.12) 

which is in the form of the one field integral that we know how to do. 3 
We have that A = — (9 2 + m 2 ) and our answer is that 

Z 0 [J] =B[detA(x,y)]-*e-^ dixdiyJ(x) l iA - 1{x ’ y) } J{y \ (24.13) 

where B is a (potentially infinite) constant determined by the integration 
measure. The determinant also looks worryingly infinite! 

Normalization will save us here. The normalized generating functional 

is Z 0 [J] = Z[J\/Z[J = 0], where Z 0 [J = 0] = B det A(x,y) 2 . This 
removes two divergent quantities (B and the determinant) at a stroke. 


Q 

We will see that this is telling us that 
a Lagrangian that can be massaged 
into this form may be quantized using 
functional integration. The family of 
quadratic and bilinear Lagrangians for 
which this is true are exactly those that 
may be diagonalized by canonical quan¬ 
tization and are conventionally called 
non-interacting. 


Recall that the normalization ensures 
that the amplitude to start and end 
with no particles, in the absence of a 
source, is unity. 


(24.10) 


Note that in eqn 24.10 and later we’ve 
suppressed the spacetime variable x to 
avoid unnecessary clutter. 
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Finally, we recognize A(x,y) = — (d 2 + to 2 ) as an operator giving 
rise to the Klein-Gordon equation of motion and note that its inverse 
determines the free scalar propagator via 

, , r d 4 n ip-ip-(*-i0 

iA 1 (x,y)= A(x,y)= / -y—-■ (24.14) 

J (z7t) 4 p z — m z + le 


Example 24.2 

To check this, we recall that the inverse of an operator satisfies AA~ 1 = 7, which in 
the continuum limit becomes 

— (d 2 + m 2 )A~ 1 = S ( ' 4 \x — y), (24.15) 

where the d 2 operator acts on the x variable. Next we note that we defined A (a;, y ) 
via 

— (d 2 + m 2 )A(ai, y) = i 5^ (x — y), (24.16) 

so we can identify L4 _1 = A (x,y), that is, the Green’s function of the equation of 
motion, also known as the Feynman propagator. 


Putting all of this together gives us a normalized generating functional 

Notice that the field <p doesn’t appear 
on the right-hand side of eqn (24.17). 

We say that we’ve ‘integrated out’ the 
field (j)(x ). 


for the free scalar held of 


Zo[J\ = 


J T>cj)e^ f dix H-( a2 + m2 )}« !, + i f d4x J<t ’ 


j /4 4 x0{-(O 2 +m 2 )}d 

= e -5 /d 4 rcd 4 y J(x)A(x-y)J(y). 


(24.17) 


Example 24.3 

We now justify that eqn 24.17 is the same generating functional for the free scalar 
field that we defined in Chapter 22. The Chapter 22 definition of the generating 
functional may be written 

OO . yi, p 

Z[J] = d 4 zi...d 4 x„ J{xi)...J(x n )(n\T4> H (xi)...4>H{x„)\n). (24.18) 

n =0 

Note that for a non-interacting field we have <j >h = e lHot (f)e~ lHot = cf)i and so we can 
use Wick’s theorem (which applies to freely evolving interaction picture fields only) 
to expand any time-ordered products. They fall apart into a sum over products of 
free propagators. For example, the n = 4 term involves a product 

(0\T<f>i(xi)4>i(X2)<t>l(X3)<t>i(X4)\0) =A(xi - X2)A(X3 ~ X 4 ) + A (®1 - X3)A(X2 ~ X 4 ) 

+ A(rri — X 4 )A(x 2 — X 3 ). 

Renaming variables, we see that each of the three terms will therefore contribute a 
factor f[J(xi)A(xi — X 2 )J(x 2 )] 2 ■ This enables us to write the fourth-order term as 

i 4 7 „ 

— I d i Xi...d A X4,J(x\)...J(x4,)(Q,\Ttj>' Si {x- i _)...<j) W _(x4)\^) 

d A x\d A X 2 J{xF)A.{xi ~ X 2 )J{x 2 ) S j . (24.19) 

Repeating this process for other values of n generates the corresponding terms in the 
exponential and the answer indeed reduces to eqn 24.17. 
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The point of finding the generating function is to give us access to prop¬ 
agators. Specifically we have for free fields that the propagator is given, 
in terms of the normalized generating functional, by 


G^ l \x 1 , ...,x n ) 


S n Z 0 [J) 


SJ(xi)...SJ(x n ) 


j=o 


S n Z 0 [J] 


Z 0 [J = 0] 5J(xi)...5J(x n ) 


j=o 


(24.20) 


We’ll evaluate this in two different ways for the single-particle propagator 
Go(x,y). Differentiating the expression for the functional integral 2o[J] 
with respect to the J’s gives us 


Go(x, y) 


f V(j) cf>(x)(/>(y)e 1 dixC °^ 1 
f V^e'-T dixC °W 


(24.21) 


while differentiating the expression for the normalized generating func¬ 
tional Z 0 [J] = e“s/ d4 * d4 y J{x)^{x,y)J(y) gj ves us the expected answer 
Go(x, y) = A(x, y ). Thus the Green’s function, which we wrote down as 
(fi|T</>H(a;)(/>H(2/)|^)) is obtained by integrating two fields weighted by 
a factor e 1 ^ d which tells us how the amplitudes of different field 

trajectories are distributed. 4 


Example 24.4 

We’ll apply what we’ve learnt to the interesting case of the free massive vector field 
theory from Chapter 13 described by 


= - i {d tl A v & l A v - dpA v d v A^) + U^A^A^ 


(24.23) 


(see Exercise 13.4 if in doubt). The only complication with this is the different 
components of the A^ field. Note that in this case the result of the one functional 
integral that we can do is written 

P T) A ah f A**Kuv A u +i f d 4 x Jn,A^ 

Za rji = i VAe 2 =c -l/d 4 xd 4 yjM(x)[iX-)]r(y) 

(24.24) 

We can, therefore, only do our functional integral if the Lagrangian is in the form 
| KA u , where is a differential operator (with an inverse). Just as before, 
the way to achieve this is, as usual, to integrate C, by parts. Of the three terms in 
the Lagrangian, the final one is already in the correct form. Integrating the first one, 
and disregarding the boundary term, we get ^A^d 2 A^ = ^ A^d 2 g^ u A u . The second 
term yields —^A^d^dvA 1 '. The result 5 is that 


^K^A" = ? ,1" [(d 2 + m 2 )g M „ - d„d„] AT 


(24.25) 


We therefore do the integral 


Zo[JJ 


f VA ei f dix A * ^+ m2 )■ - d e 8 " 1 A " +i / d 4 x A^ 

fVAe^ f d4x AIJ ’[( d2 + rn2 '>s^-dp.d^] A ‘' 

e -| /d 4 a;d 4 !/ J** (x) J" (s/) ^ 


4 This is just as we have in classical 
statistical physics, where the analogous 
expression for the average of two fields 
is 


(</’!</> 2>t = 




V c-Pni&iAj) 

Z-^ij 

(24.22) 

where the sum is over all configurations 
of (pi and <pj weighted by the Boltz¬ 
mann factor and divided by the normal¬ 
izing partition function. 


c 

You are invited to confirm this result 
in Exercise 24.1. 


(24.26) 
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where iis the free particle propagator for the theory Go^u(x,y). This already 
tells us something very interesting. Namely, that we have access to the free prop¬ 
agator for a theory if we can work out . This is often an easier method than 
working out, for example, Go^ u (x,y) = (0\TA^(x)Al(y)\0) as we would have had to 
have done here using a mode expansion if we were still using canonical quantization. 
Following this through, we can therefore obtain the free particle propagator from the 
following equation (with the indices all included): 

[( d 2 4- m 2 )g^ - d^d"} G 0v \{x,y) = i g^ A \x - y), (24.27) 


and the easiest way of doing this is to take a Fourier transform, yielding 
[-(p 2 - m 2 )g^ +//]G 0 ^W = l 9\i 
whose solution may be shown to be 


Gq u \{p) 


-i ~ fVPA/ra 2 ) 

p 2 — m 2 


(24.28) 


(24.29) 


which (with the addition of ie in the denominator) is the massive vector field propaga¬ 
tor. Interestingly, this has the form of the scalar propagator which we met previously, 
but this time with the transverse projection tensor on top, containing all of the in¬ 
dices. 


Chapter summary 


• The normalized generating functional for free scalar fields is 
Z 0 [J) = e~^ dix< ^ v J ( x ) A ( x ~y) J (.y ). 


Exercises 


(24.1) (a) Verify eqn 24.25. 

(b) Show that eqn 24.29 solves eqn 24.28. 

(24.2) Consider the (j> A Lagrangian C = |(<9 M c)>) 2 — — 

| <f> A . Shift the Lagrangian by adding a new field a 
via 2 

r' = r+^(a-|<)» 2 ) . (24.30) (24.3) 

(a) By performing a functional integral over the 
field a, show that a doesn’t change the dynamics of 
the theory. 

(b) Confirm that a has no effect on the dynamics of 
the theory by finding the Euler-Lagrange equation 
for a and showing that is has no time derivatives. 

This means that it can only provide a constraint and 
may be eliminated, much like the component A° in 
the massive vector field in Chapter 13. 

(c) Contrast the Feynman diagrams of (j> 4 theory 


with those of the shifted theory involving a. 

The introduction of a new field to shift the La¬ 
grangian in this way is known as a Hubbard- 
Stratonovich transformation and is very useful in 
removing the inconvenient <p A term from a theory. 

In this problem we motivate the perturbation expan¬ 
sion for the functional integral. 

We want to do the integral 

Z(J) = J &xe~^ Ax2 -v xi+Jx . (24.31) 

(a) Show that this integral may be recast as 

1 ( A d 4 Y 

n\ V 4! dJ 4 J 

(24.32) 


dxe 


— ^Ax 2 + Jx 


Z(J) = 


E 
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which has the advantage that we’ve effectively re¬ 
moved the interaction from inside the integral, 
(b) Re-exponentiate the series in the first square 
bracket to give a differential operator acting on an 
integral and then do the integral to obtain the result 


Z(J) = 


' A 8 4 ' 


/2tt\ 2 j 2 

e 4! FT 4 


— e 2A 

- 


V A J 


(24.33) 


We now have a perturbation series that can be ex¬ 
panded by operating on the free generating function 
Zo(J) = ( 2 j) h e£. 

(24.4) By analogy with the previous problem, the gener¬ 
ating functional for theory is given by 


Z[J] = 



/ d 4 


S 4 I 

■s-n *) 4 


Zo[J], 


(24.34) 


where 2o[</] is the normalized generating functional 
for the free scalar field. In this problem we investi¬ 
gate the first-order term in the expansion of Z[J], 
given by 

Zi[J] = j_ix jd 4 2 s 4 4 j e ~ i f dixd *v J(x)M*-y)J(v) _ 

(24.35) 

We will act on Zq[J] = e _ 2 / JAJ with s f^ four 
times. This will rely on some functional differenti¬ 
ation from Chapter 1. 

(a) Verify that hitting Z 0 [J] once gives us 


SZ 0 [J] 

8J{z) 


- j d A y A(z — y)J(y) 


Zq [J]; (24.36) 


(b) the second time: 


8 2 Z 0 [J] 
8J(z) 2 


—A (z — z) 


{/ 


+ < d t/A(z - y)J{y) 


(24.37) 


Z 0 [J\; 


(c) the third: 

5 3 Z 0 [J] 

8J(z) 3 


3A (z - z) j^d 4 y A(z - y)J(y) 


{/ 


d yA(z - y)J{y) 


(d) and finally the fourth: 

5 A Z 0 [J} 


5J(z 


= [3A(z - zf 


{/ 


Z 0 [J\- (24.38) 


(24.39) 
2 


—6A(z — z) < / d y A(z — y)J(y) 


+ | J d i yA(z-y)J{y )j 


Z 0 [J\. 


(e) Expand the brackets, multiply by —iA/4! and 
integrate f d 4 z to end up with the final result that 
Z\ [ J] is given by 


Zi [J] = -iA 


^ J d 4 z A(z — z) 2 (24.40) 

d 4 zd 4 j/id 4 i/ 2 A(z - z) 
xA (z - yi)J(i/i)A(z - y 2 )J(y 2 ) 


{/' 


+ jf| J d 4 zd 4 yid 4 j/ 2 d 4 i/3d 4 i/4 A{z - yi) 
*J{yi)A(z - y 2 )J(y 2 )A(z - y 3 )J(y 3 ) 


xA (z - y 4 )J(y 4 ) 


Z 0 [J\. 


We see that this has given us terms in J of order 
O(J) = 0, 2 and 4. 

(f) Interpret this result by drawing Feynman dia¬ 
grams for each term. This procedure can be used to 
work out the Feynman rules for a theory. For more 
detail see Ryder, Chapter 6. 
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It is the mark of a truly intelligent person to be moved by 
statistics. 

George Bernard Shaw (1856-1950) 

We’ve seen how taking the derivatives of the generating functional 
Z[J] = f V(j)e 1 i Aix ^ +J ^ allows us access to the all-important 
Green’s functions of a quantum field theory via 


(n\T(j>(x 1 )...(i>(xn)\ty 


1 1 5 n Z[J] 

i” Z[J = 0] 6J(xi)...6J(x n ) 


j=o 


(25.1) 


1 Usually we employ units where fce = 
ft = 1 so that 0 = 1/T = it. 


This is very similar indeed to the state of affairs in statistical mechanics 
that we examined in Chapter 21. There we found that we had access to 
all of the interesting correlation functions via derivatives of the partition 
function 


,? X \ - 1 d^( J ) 

wv-p.jt Z {j = o)dJ il ...dJ in 


(25.2) 


In fact, our motivation for searching for a quantum field generating func¬ 
tional was simply the ease with which calculations in statistical physics 
can be made with a partition function. The similarity between the two 
disciplines is not just a superficial resemblance but reflects something 
much more fundamental. The crux of the matter is that the Green’s 
functions of quantum field theory are related to time-evolution opera¬ 
tors, which (roughly speaking) look like a complex exponential 
with E an energy and t the time. The probabilities (and density ma¬ 
trices) of statistical physics are based on the Boltzmann factor, given 
(roughly) by the real exponential e~ E / k ^ T . The fundamental elements 
of the two theories are therefore both exponentials, albeit one is real and 
the other is complex. This similarity can be exploited by making the 
substitution 

i t i 

(25.3) 


it 

h 


l 


knT 1 


and the claim is that this will map a quantum field theory on to a 
statistical field theory. The advantage of this mapping is that it allows us 
to transfer results from one subject directly to the other, and vice versa. 
Within the framework of the mapping, inverse temperature 1 (3 = 1 jk-^T 
in statistical physics behaves like imaginary time in a quantum field 
theory. It turns out that the concept of imaginary time is a remarkably 
powerful one with uses across quantum and statistical field theory. We 
examine the imaginary time formalism in the next section. 




25.1 Wick rotation and Euclidean space 229 


25.1 Wick rotation and Euclidean space 


Imaginary time, assigned the symbol r = it, is defined, not simply by 
multiplying time by i, but by making a rotation in the complex plane 
by an angle 2 — 7r/2 (Fig 25.1). This transformation is known as a Wick 
rotation and leads to a very nice feature for spacetime four-vectors. By 
rotating the tinre-like part of our vectors we essentially remove the an¬ 
noyance of the Minkowski metric and replace it (to within 

an overall minus sign) by the more civilized (+,+,+,+)• This means 
that all four components of our vectors are treated equally, in the same 
way that all spatial components are treated equally when doing Eu¬ 
clidean geometry in three dimensions. We refer to this four-dimensional 
space as Euclidean space. 


Example 25.1 

Let’s work through the consequences of this transformation. To make the Wick 
rotation to Euclidean space we will rotate the time coordinate by — 7r/2, defining 
xq = —it. This means that 

x 2 = Xq — \x\ 2 = -(r 2 + |a;| 2 ) = -x|, (25.4) 

where = r 2 + x 2 . Note that it also implies that 

d^x = — id 4 a?E = — idrd 3 #. (25.5) 

We still want things like Fourier transforms to work in Euclidean space. The re¬ 
quirement that the exponential factor e ip x will not blow up requires us to rotate the 
energy part of the momentum vector po by n/2 in the complex plane. We therefore 
define pQ = iu;, giving 

P 2 = Po - \p \ 2 = -(^ 2 + IpI) 2 = -pi- (25.6) 


o 

Positive angles correspond to anti¬ 
clockwise rotations in the complex 
plane. 


Euclid of Alexandria was a Greek math¬ 
ematician who lived around 300 BC. 
He is best known for his Elements , the 
textbook which laid the foundations 
for geometry and has influenced gen¬ 
erations mathematicians for more than 
two thousand years. 



We can also write the element of four-volume in momentum space as 

d 4 p = id 4 pE — ida; d 3 p, 
and the inner product between p and x as 


p • x = poxq — pa j = cor — p ■ x. 

We also consider the derivatives. We have that do = id r and 
d 2 = dl - V 2 = -(d 2 + V 2 ) = -<9|, 
where d^ = d 2 + V 2 . Finally we need to note that 

09 u<t>? = - [( d T <fi ) 2 + (V<)>) 2 ] = -(9e m 0) 2 . 


(25.7) 

(25.8) 

(25.9) 
(25.10) 


Fig. 25.1 The Wick rotation involves a 
rotation of — n /2 in the complex plane. 
We also show the poles of the Feynman 
propagator, demonstrating that Wick 
rotating of the contour of integration 
from the real axis (integrating from 
— oo to oo as we do in calculating Feyn¬ 
man amplitudes) to the imaginary axis 
(integrating —ioo to ioo) is permissible 
as we don’t cross any singularities. 


To summarize the previous example: 

The rules of Euclidean space 


po = icu p = 

Xo = —IT X 2 = 

do = id T d 2 = 


-( W 2 

-( t 2 + X ' 

-(a 2 1 


p 2 ) = 

2 


= —x 


Pe 

,2 


V^) = 


-91 


d 4 p = idw d 3 p = id 4 pE 
d 4 cc = —idr d 3 a: = —id 4 a;E 

( a ^) 2 = - [( a r ^) 2 + ( v ^) 2 ] 

(25.11) 
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3 

Note that we often write 
Ce[4 >\ = j(9t0) 2 + U[<t>\, where 

U[<t>\[= i(V0) 2 + ^</» 2 + is 

the potential energy density. 


Example 25.2 

Our goal is to see the effect of the Wick rotation on the path integral. We start with 
the </> 4 theory with sources for which we have 




- ^</> 4 + J<t> 


(25.12) 


§(^ r<t>? + ^(V# + ^-0 2 + - J<$> 


(25.13) 


iS = i <5rx 

L 2 ' 

Making the rotation results in the expression 3 
i S = — j d 4 ^E 
Defining 

Se = J d 4 £E (£ E [</>] - j<t>) = J d 4 z E 

we write a Euclidean path integral 

Z[J] = J T><t> e - SE = j V<t>e~ . (25.15) 


-(drip) 2 + -(V0)“ + —fi 2 + ~ J<t> 


(25.14) 


This is a useful mathematical object as it stands since it makes the path integral 
somewhat more respectable. As we shall see in the next example, it removes the 
ambiguity in defining the propagator which forced us to introduce those factors of ie. 


^We drop the subscript ‘E’ on position 
and momentum variables in Euclidean 
space from this point. 


Example 25.3 

This Wick rotation may be carried out on some of our most useful equations to 
express them in Euclidean space. The Feynman propagator for free scalar fields in 
Minkowski space is 


A(®) = / 

d 4 p ie- 1 ? 1 * 

(25.16) 

(27t) 4 p 2 — m 2 + ie ’ 

which is the solution to the equation 



( d 2 + m 2 ) A(: 

w 

1 

II 

! 

IT 

1 

(25.17) 

In Euclidean space 4 this becomes 



a e (®) = 

f d 4 p e~ ipx 

J (27r) 4 p 2 + m 2 

(25.18) 


In the last equation we’ve dropped the ie, since it is no longer required in this equation 
which has no poles (see also Fig. 25.1). This is the solution to the equation 

(-9| + m 2 ) A e (x - y) = S^\x - y). (25.19) 


That this is the case may be shown by transforming from Minkowski to Euclidean 
space. We make the changes 


a 2 -* -a 2 , 


A(x) -> A b (x). 

To see how the delta function transforms note that its definition is 

d4 p 


(25.20) 


S(x°)5^ (x) = J 


(2-rr) 4 J (2 t r) 

which may be substituted to give the desired result. 


i / x ^iS{r)5^ (a),(25.21) 
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25.2 The partition function 

We will now attempt to express the partition function as a path integral 
and will find that it involves the Euclidean generating functional. The 
partition function is given by Z = Tr e ^ l3H = 5Z A (A|e _/3ff |A). We 
know how to express a similar object as a path integral. In Chapter 23 
we found 

(qB\e- i 6 tB \q A ) = j V[q(t)\ e [ ^ dtL ^\ (25.22) 

which tells us that to evolve the state at A into that at B we add up 
all of the possible trajectories that start at A at t = 0 and end at B at 
t = te- Let’s cook up a similar description of X0 A (A|e _/3ff |A). 

First set |<7a) = |<Zb) = | A), sum over A and make the replacement 
Ib —» —i/3- This turns the left-hand side into the desired 
where the consequences of the sum will be discussed below. Now for the 
right-hand side. The replacement Ib —* —i/3 provides the integral in the 
exponent with the new limits ffZ 0 1/3 . To give this meaning we make the 
Wick rotation t —> — ir and we obtain 

53<A|e-^|A) = X) /^[g(^)]e _/ o /3dTiE[9(T)1 , (25.23) 

A A 

where L^qij)} = ^ (jf) + is the Euclidean version of the La- 
grangian for a single particle. 

The question is, how does the sum on the left affect the trajectories 
over which we’ll need to integrate? If we think of e~ l3H as an evolution 
operator, then we’re taking a state |A), evolving r from 0 to /3 and 
requiring that we end up at the same state. This means that we must 
have q{r = 0) = q(r = /3). But that’s not all. If we imagine r as 
extending from —oo to oo then the boundary conditions are periodic: 
after a period /3, the state returns to where it started. The sum on the 
left means that we sum over all configurations subject to these periodic 
boundary conditions. A quotable conclusion is that ‘statistical physics 
takes place in periodic, imaginary time’. 

This approach can be simply extended to fields. If H is the Hamil¬ 
tonian describing what’s happening to fields in three-dimensional space 
then we have a partition function given by the field integral 

Z = Tr \e~ 0k \ = [ £></> e“ -^o dT f d3 a£E [ <K*)], (25.24) 

-I J PBC 

where the periodic boundary conditions (illustrated in Fig. 25.2 and 
denoted ‘PBC’) are now </>(0, x) = (j>(f3,x) and £e[</>] is the Euclidean 
Lagrangian density. 5 Identifying drd 3 x with d 4 XE, we see that this is 
the Wick rotated version of the functional integral. 

We made the replacement xo —> —ir and so in momentum space we 
must make the replacement po iu. In fact, to guarantee the periodic 
boundary conditions are encoded in our functional integral we choose to 
work with the Fourier transforms of the fields. Since statistical physics 



Fig. 25.2 (a) The trajectory of a com¬ 
plex scalar field ^ in time t , where 
— oo < t < oo. (b) In statistical physics 
the field is defined only from imagi¬ 
nary time r = 0 to r = (3. For Bose 
fields like V>(t) or the real scalar field 
cf>(r) we impose the boundary condition 
(f)(0) = (j>(/3) to give a picture of periodic 
boundary conditions shown in (c). 

Note, however, that these boundary 
conditions are appropriate for Bose 
fields only. For Fermi fields we must 
impose the antiperiodic boundary con¬ 
dition that ^(0, x) = — t/>(/3, x ). 
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6 The rather awkward normalization is 
chosen to make the field f n (p) di¬ 
mensionless. This is done with mal¬ 
ice aforethought in order to guaran¬ 
tee that the expressions which will re¬ 
sult make sense in describing a statisti¬ 
cal system. An important expression 
required to manipulate these fields is 
f drd 3 o: e i “ nT e —ip :E = /3V<5 p ,o<5„„, 0 . 

This is another condition only appro¬ 
priate for Bose fields. To impose the 
antiperiodic boundary conditions for 
Fermi fields we require that the Mat- 
subara frequencies obey u n = 27r (^" t ~ 1 ) 

Takeo Matsubara (1921- ) 


involves systems in boxes and explicit factors of volume we will put the 
system in such a box of volume V which discretizes momentum space. 
The Fourier representation is then given by 6 

<l>{T,x) = \J^Y, e ~ iUnT+iP ' X ^(p), ( 25 . 25 ) 

n,p 

where the frequencies (quantized by the boundary conditions) must sat¬ 
isfy co n = with n an integer to ensure the periodicity. 7 The w n ’s are 
known as Matsubara frequencies and in our calculations the integral 
over imaginary time will become a sum over these frequencies. 

We may now state the rules of statistical field theory required to find 
the partition function. 

The rules of statistical field theory 

To find the partition function for scalar field theory follow the following 
recipe 

• Write down the Euclidean Lagrangian £ E = |( d T ()b) 2 + U[4>]. 

• The partition function is given by the functional integral of the 
Euclidean action, with periodic boundary conditions (PBCs) 

Z[J]= f Vcf) e -S&Td 3 x(C E [<t>]-J<t>), ( 25 . 26 ) 

J PBC 

where <f) is a function of r and x. 

• Impose the PBCs with the expansion 

<Kt,x) = ( 25 . 27 ) 

n,p 

where u> = 2irn/(3. 

• Note that we now use 

J dTd 3 xe i0J " T - ip x = PV8 p fi8u n ,o- ( 25 . 28 ) 


Example 25.4 

Let’s examine the familiar example of free scalar field theory in more detail. The 
Euclidean Lagrangian is given by 

£e = i (a T 0) 2 + ^ (V<« 2 + ^ 2 . (25.29) 

This leads to a partition function 


Z[J\ = 


L 


T>(f)e 


— f drd 3 x ^(d T 4>) 2 +^(V 4 >) 2 -\- </> 2 — J</> 


( 25 . 30 ) 
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We will calculate some thermodynamic properties of this theory. We therefore set J = 
0 and insert the Fourier expansion of the fields into the argument of the exponential. 
We obtain an argument of 

^[drd 3 x i Wn )(-ia- m )-H-i P ).(-i 9 )+m 2 ]^ n (p)e- i “^+ i P- 3! 

n,m,p,q 

= 2\7 F, <t > rn{q){-U>nU>m - p ■ q + m 2 )0„ (p)/3V<5 p , _ q 

n,m,p,q 

Q2 

= - — ^0»(-p)(w» +p 2 + m 2 )<j> n (p). 
n,p 

(25.31) 


This results in a partition function 

Z[J = 0]=f X>^ e - § TT. n pM-p){^ 2 n +p 2 +rn 2 )i n ( P ) (25.32) 

J PBC 

This is, of course, in the form of our favourite (and only) tractable functional integral. 
The solution (for J = 0) is that 

Z[J = 0] = Ndet [p 2 (w 2 +P 2 + m 2 )}~^ . (25.33) 

This is diagonal in p, allowing us to write In [Z(J = 0)] as the expression 
[d 2 (cj 2 + p 2 + m 2 )] + const. The sum over n can be done (using a couple 
of tricks described in the book by Kapusta and Gale) which allows the replacement 

In [/3 2 (uJn + E 2 )] = (3E P + 2 In ^1 — e~ f3Ep ^j + const., (25.34) 


where E p = \/p 2 + m 2 . To obtain the thermodynamic limit we make our usual 
replacement for summing over closely spaces levels in phase space, —> V f ’ 
leading to a result 


InZ = V 


/ 


d 3 p 

(27r) 3 



(25.35) 


We 


then have a Helmholtz energy F = ^ f [§/^p + In (l 


— e 


)]• 


7 r) a L 2 ^ 

Notice that the zero-point energy leads to a ground-state energy given by Eq = 
— -JfplnZ = f ( 2 tt)‘ 6 Ep [which is just like the from ( n + ^hw)\ and this is 
associated with a pressure Po = — • Our meaningful energies and pressures 

will be expressed relative to these vacuum values. Finally we obtain an energy, 
expressed relative to the (arbitrary) ground state energy of 


E - E 0 = V 


I 


d 3 p E p 

(27r) 3 e^ E p — 1 


(25.36) 


and a pressure 

p - p »-s/(S -M 1 -’-™’)- < 2537 > 

Although we have not learnt anything we couldn’t have extracted with simpler 
methods it is, as usual, heartening that the more sophisticated machinery pro¬ 
duces the same results. The power of statistical field theory comes in extract¬ 
ing predictions from unsolvable theories such as theory, to which we now turn. 


25.3 Perturbation theory and Feynman 
rules 

The joy of statistical field theory is that the rules we have already learnt 
which allow us to encode perturbation theory in terms of Feynman dia¬ 
grams carry over wholesale to statistical field theory. This is not magic; 
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8 See Exercise 25.2. 


we have made every effort to construct things this way! We found before 
that the generating functional could be expressed in terms of Feynman 
/ Connected source-to-source \ 

,. ^ r T i \ and vacuum diagrams / 0 . ,1 

diagrams as Z|J| = e ' ° / . bmce the re¬ 

sults of statistical field theory depend on In Z rather than Z we use the 

, , „ r Tl ^ / Connected source-to-source \ _ 
neat result that m Z \J = > I . .. . Remem- 

\ and vacuum diagrams J 

ber that the sources are included in order that we can extract Green’s 
functions. If we are just interested in the thermodynamic equilibrium 
properties we can turn off the sources and we have the key result 

In Z[J = 0] = (Connected vacuum diagrams). ( 25 . 38 ) 

In statistical physics we often find ourselves in a situation where the 
average value of the operator (<j>(x))% ^ 0. An example is the ordered 
magnet encountered in the next chapter. In that case, more useful in 
determining the fluctuations in the fields is the connected correlation 
function 

G c (x, y ) = ((j>(x)(j>(y))t - ($(x)) t {$(y)) t . ( 25 . 39 ) 


This may also be derived 8 directly from In Z too: 

G c (x,y) = {4>(x)$(y)) t - ($(x)) t ($(y )) t = d j^)dJ( y ) lnZ ^ 

E f All connected diagrams with two 
y external legs 

Since there are far fewer connected diagrams than disconnected dia¬ 
grams, this is a wonderfully labour-saving simplification. This also ex¬ 
plains why G c (x,y) is known as the connected correlation function. No¬ 
tice that this is the same result we had in eqn 22.15, where we insisted 
on normalizing the generating functional. 

We already know how to draw diagrams, so the question we must 
ask is how the calculations encoded by these diagrams differ from those 
we’ve seen before. Luckily, all of the Feynman rules and momentum 
space tricks we’ve built up can be employed with little modification. 
In the case of scalar field theory, the propagator in momentum space 
i/(pQ — p 2 — to 2 + ie) becomes the Euclidean space version 1 / (w 2 +p 2 + 
m 2 ). Integrals over all momenta will now involve a sum over Matsubara 
frequencies in place of the integral over po. The conversion rule is that 



d 4 p 

( 2 ^ 



(25.41) 


As long as we also have (2tt)S(p^ 1 — p^) —> (35 UJn ^ ulm then all is well. 

As an example, we now state the Feynman rules for translating dia¬ 
grams into thermodynamic quantities for cf) 4 statistical field theory: 
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The Feynman rules for cf> 4 theory for T± 0 

To calculate the amplitude of a diagram in statistical field theory 

• Each internal propagator line contributes - tt- n ■ 

LU Z + p- + m z 

• A factor —A results from each vertex. 

• Integrate over all unconstrained internal momenta with a mea- 
surel£ n /^. 

• Divide by the symmetry factor. 

• An overall energy-momentum conserving delta function 
(27 t) 3 (5 (3) (P in - p 0 ut)/^n,w m is understood for each diagram. 
For vacuum diagrams, this results in a factor /5(27 t) 3 <5 3 (0) —* /3V 
in the thermodynamic limit. 

For practice, we now evaluate a few diagrams in statistical field theory. 


Example 25.5 

The first-order correction to In Z[J = 0] is given by the double-bubble diagram shown 
in Fig. 25.3(a). The symmetry factor for this one is D = 8, so it makes a contribution 
to InZ of 


lnZ(i) = - 


XV 0 


iVl 


d 3 p 1 

(27r) 3 uj 9 + p 2 + in 2 


(25.42) 


The first-order correction to the propagator G(i) (&,<;) is the line-bubble shown in 
Fig. 25.3(b). The symmetry factor is D = 2 and the amplitude for this diagram is 
given by 9 


G ( p(M) = 


-A 

~Y 


??/ 


d 3 p 


(2t r) 3 Ji 


-(2ir) 3 S^(k- q)08 ul 


(25.43) 


Example 25.6 


We finish this chapter by examining some of the consequences of these corrections on 
the thermodynamics of the scalar field with </> 4 interactions. The loop integral and 
Matsubara sum in brackets in eqns 25.42 and 25.43 can be done. 10 In fact, it may 
be shown that it falls apart into two pieces: 


[ d*p 1 

f d 3 p 1 

^ 1 ^ 

/ (2tt) 4 (p0)2 + El J 

(2tt) 3 E p 1 

\ K e0 E P — 1 ) 


(25.44) 


The first term is independent of temperature and will not concern us here. The 
second term provides the first-order thermodynamic correction to In Z and to the 
particle energies. 

The resulting contribution to the temperature-dependent part of In Z is 

\0V \ r d 3 P 1 / 1 x 12 


In Z (1) = - 


[ d 3 p 1 / 

J (2 it ) 3 E p V< 


1 

,/3.Ep _ i 


00 



Fig. 25.3 (a) The double-bubble di¬ 

agram is the first-order correction to 
lnZ[J = 0] in </> 4 theory, (b) The line- 
bubble diagram is the first-order correc¬ 
tion to the propagator. 


^ Here the incoming line carries four- 
momentum q = (toi , q) and outgoing 
line four-momentum k = (cj m , k ). 


^See Kapusta and Gale, Chapter 3. 


(25.45) 
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11 See Exercise 25.3. This will give a contribution to the pressure that varies 11 as T 4 in the limit that 

m —> 0 . 

Turning to the line-bubble correction to the propagator: we note that in Chap¬ 
ter 16 it was suggested that the stuff that fits between the two external lines of the 
propagator, known as the self-energy, gives a correction to the particle dispersion 
E p . (In Chapter 33 we will see that this is true.) For our present purposes we claim 
that the correction to E p is provided by the part in curly brackets in eqn 25.43. In 
particular, it is straightforward to show that in the limit that m —> 0 the correction 
to E p varies as AT 2 . 


Chapter summary 

• Statistical field theory is obtained from quantum field theory by 
a Wick rotation, so that (temperature) -1 behaves like imaginary 
time. 

• The diagrammatic techniques developed for quantum fields can 
then be used to solve problems in statistical physics. 


Exercises 


(25.1) (a) What are the dimensions of the field in <j > 4 the¬ 
ory? Show that the normalization in eqn 25.25 
makes the held (j> n {p) dimensionless. 

(b) What normalization would be required for non- 
relativistic particles? What about Dirac particles? 

(c) Investigate the consequences of choosing a dif¬ 
ferent normalization for <)> 4 theory. 

(25.2) Verify that G c (x,y) may be obtained by differenti¬ 
ating In Z[J\ as suggested in the text. 

(25.3) Verify the T dependence of the corrections to <j > 4 
theory claimed in the chapter, in the limit that 
m —> 0 . 

(25.4) (a) After Wick rotating the Schrodinger equation 
and investigating the consequences, show that wave 
functions evolve according to ii>(r) = e~ HT il>(0) 
and the Heisenberg equation of motion becomes 

d -t=[H,A H ]. 

(b) For a system described by a Hamiltonian H = 
uic'c show that the operators evolve according to 
c(t) = e~^ T c and $(r) = e u ' T cb 

(25.5) Using a Wick rotation of / G(E)e~ lE( - tx ~ ty 1 = 

/ ie~ iEf - tx ~ ty) /{E - E p + ie), suggest a prop¬ 

agator for describing non-relativistic electrons at 
nonzero temperature. By convention the propaga¬ 


tor for T ^ 0 is defined as minus this quantity, 
preventing a proliferation of minus signs. 

Define the imaginary time free propagator for the 
quantum oscillator as 

G(t) = -{Tx(t)x( 0)) tl (25.46) 


where T is the imaginary time ordering symbol and 
the average is a thermal one. Vitally important here 
is that (cda)t = (h )t = {e 0w — l) -1 rather than zero! 
(a) Show that 

G(t) = - [0(r)((n) t + l)e - “ T + 6»(r)(h) t e“ T 

+ 8(-r)({h) t + l)e" T + e(-r)(h) t e - “ T ] . 

(25.47) 


(b) Using the definition 

G(w n ) = / d rG(T)e“'" T , (25.48) 

Jo 

where v n is a Matsubara frequency satisfying v n = 
, show that the propagator may be expressed as 


G(iu n ) 


1 

2 mu> 


1 

i v n - OJ 


1 

iv n + W 


1/m 

(hz „) 2 - lo 2 ' 
(25.49) 




Broken symmetry 



On the earth the broken arcs; in the heaven, a perfect round. 

Robert Browning (1812- 1889) 

In this chapter we turn to the profoundly important topic of broken 
symmetry. It’s hard to overestimate the influence this concept has had 
on condensed matter and particle physics and on quantum field theory 
in general. We will start by discussing the arena where these ideas were 
born: Lev Landau’s theory of the statistical physics of phase transitions. 
We then turn to classical field theory and will see how broken symmetry 
manifests itself in a system described by a Lagrangian. 

26.1 Landau theory 

Our discussion of Landau’s theory of phase transitions begins with a 
simple observation about magnets. We imagine a magnet whose spins 
can point either up or down. It’s well known that at high temperatures 
each spin is equally likely to be found up or down. The magnetization, 
that is, the spatial average of the magnetic moment, is zero. This system 
is shown in Fig. 26.1(a). This system has a symmetry: turn all of the 
spins through 180° [as in Fig. 26.1(b)] and the magnetization is still zero. 
Of course, each individual moment is pointing in the opposite direction, 
but the number pointing in the upwards direction is still half of the total 
and the magnetization is still zero. The symmetry here is a global one: 
we rotate all of the spins through the same angle, here 180°. 

It is found experimentally that upon cooling the systems through a 
critical temperature T c , the system undergoes a phase transition and 
the magnetization M becomes nonzero as all of the spins line up along a 
single direction, as shown in Fig. 26.2(a). The direction along which the 
spins align could either be all in the up direction or the down direction. 
If we rotate each spin of the aligned system through 180° [Fig. 26.2(b)] 
M is obviously reversed. We say that the system has broken (or lowered) 
its symmetry in the ordered phase. 

One puzzling feature of this story is the reason why the system chose 
to point all of the spins in the one direction rather than the other. 
After all, there’s nothing in the Hamiltonian which describes the system 
that distinguishes between up and down. The original symmetry of the 
system appears to have spontaneously broken. The same thing happens 
in the Euler strut shown in Fig. 26.3. A weight is balanced on top of 
an elastic strut. If the weight is large enough the strut will buckle. The 
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magnet at 
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> T c . The 


magnetization, or average moment, is 
zero in (a) and, after rotation of each 
spin through 180°, is still zero in (b). 
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the spins align along a single direction. 
The magnetization in (a) is different in 
(b), where each spin has been rotated 
by 180°. 
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1 Of course, in real life, some very small 
perturbation or fluctuation will have 
tipped the system towards its choice of 
ground state. 

2 Landau’s scheme is a form of mean- 
field theory , where the magnetization is 
described by a uniform field M. We 
will meet mean field theories through¬ 
out this book, most notably in Chap¬ 
ter 43. 



Fig. 26.3 The Euler strut, (a) This 
has a vertical axis of symmetry and can 
buckle either (b) one way or (c) the 
other, in each case breaking symmetry. 


(a) F (b) F 



Fig. 26.4 The Landau free energy for 
(a) T > T c with a minimum at M = 
0 and (b) for T < T c with minima at 
±M 0 . 


o 

This means that any slight perturba¬ 
tion of the system which has been pre¬ 
pared in a M = 0 state will propel it 
into one, or other, of the new minima 
that have emerged at ±Mq. 


' lliis classification is due to Philip 
Anderson and described in detail in 
his classic Basic Notions of Condensed 
Matter Physics. 


buckling can be either to the left or to the right. There is nothing in 
the underlying physics of the strut and weight that allows one to predict 
which way it will go. 1 The result is that the ground state does not have 
the symmetry of the Hamiltonian describing the system. For the magnet 
this Hamiltonian could be H = —J JT SfSf + 1 , where the up direction 
is along 2 . This has no bias either favouring the spins pointing up or 
favouring them pointing down. 

Lev Landau had a scheme for thinking about phase transitions involv¬ 
ing a breaking of symmetry. 2 Since equilibrium in thermodynamic sys¬ 
tems relies on both minimizing the internal energy U and maximizing the 
entropy S' of a system, Landau considered the free energy F = U — TS. 
To find the equilibrium state of the system we need to minimize F. The 
free energy is a function of an order parameter, namely some field 
describing the system whose thermal average is zero in the T > T c un¬ 
broken symmetry state and nonzero in the T < T c broken symmetry 
state. For a magnet, the order parameter is simply the magnetization 
field M. With joyful and deliberate ignorance of any microscopic de¬ 
scription, Landau wrote F(M) as a power series 

F = F 0 + aM 2 + bM 4 + ..., (26.1) 

where a and b are parameters which are independent of M, but may in 
principle depend on the temperature. This free energy has the symmetry 
M —> —M, that is, reversing the magnetization doesn’t affect the en¬ 
ergy. If a is positive we have the free energy shown in Fig. 26.4(a), which 
is minimized at M = 0, which is the correct prediction for the high- 
temperature regime. If, however the parameter a is negative then we 
have the free energy shown in Fig. 26.4(b), which is known, by some, as 
Lifshitz’s buttocks. This has two minima at nonzero values of magnetiza¬ 
tion. These minima correspond to the spins all aligning up (M = +Mq) 
or all aligning down (M = —Mo). The previous minimum M = 0 is now 
at a position of metastable equilibrium. If we take a = ao(T —T c ) (with 
ao a constant) and b to be T-independent, then clearly a is positive for 
T >T C and negative for T < T c , and so we predict a phase transition at 
a temperatures T = T C . 


Example 26.1 

We can find the minima of F straightforwardly for T < T c . We set 

dF o 

- = 2 aM + 4b M 3 = 0, (26.2) 

8 M K J 

from which we conclude that the minima occur at 

M o = -f b ■ (26-3) 


For T < T c we have two minima at M = ±Mq. The system will choose 
one of these as its new ground state and the symmetry will be broken. 
When a symmetry is broken in a many-particle system there are four 
features 4 that one should look out for. 
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• Phase transitions We saw that in Landau’s example, the pa¬ 
rameter a in the free energy was temperature dependent. At a 
temperature T c , at which a changes sign, a phase transition takes 
place. The transition separates two distinct states of different sym¬ 
metry. The low-temperature phase has lost some symmetry, more 
precisely it is missing a symmetry element. 5 

• New excitations Our philosophy has been that every particle is 
an excitation of the vacuum of a system. When a symmetry is bro¬ 
ken we end up with a new vacuum (e.g. a vacuum with M = —Mo). 
The fact that the vacuum is different means that the particle spec¬ 
trum should be expected to be different to that of the unbroken 
symmetry state (such as M = 0 in our example). We will see 
that new particles known as Goldstone modes can emerge upon 
symmetry breaking. 6 

• Rigidity Any attempt to deform the held in the broken symmetry 
state results in new forces emerging. Examples of rigidity include 
phase stiffness in superconductors, spin stiffness in magnets and 
the mechanical strength of crystalline solids. 

• Defects These result from the fact that the symmetry may be 
broken in different ways in different regions of the system, and are 
topological in nature. An example is a domain wall in a ferromag- 
net. These are described in Chapter 29. 


5 

In fact, you don’t need to regard the 
parameters in a model as being depen¬ 
dent on anything: in the Lagrangian 
examples below we won’t, and then the 
phase transition is not such a useful 
concept. However, the link between 
phase transitions and symmetry break¬ 
ing is a very powerful one that uni¬ 
fies our understanding of an enormous 
range of phenomena in Nature. 

^See Section 26.3. 


26.2 Breaking symmetry with a 
Lagrangian 

Let us now apply the arguments developed for magnets to our La¬ 
grangian treatment of (classical) held theory. 7 Here instead of follow¬ 
ing the ground state magnetization, we’re interested in the ground state 
value of <j)(x) or (returning briefly to quantum mechanics) the expecta¬ 
tion value of the held operator (0| 0(ar) |n), which is the order parameter 
in held theory. We start with a scalar held theory 

C= -U(<j>), (26.4) 

where we’ve split off all of the potential energy terms and called them 8 
U(<f>). In the familiar case of (/> 4 theory we have that 

U(4 ’) = y 4 2 + ^ 4 - (26.5) 

This function resembles the free energy in our magnet example above. 
It admits the global symmetry (j>(x) —* —<p( x )- Assuming \i 2 is positive 
we have a potential with a minimum at <f> = 0 as shown in Fig. 26.5(a). 
The minimum corresponds to a quantum mechanical ground state, also 
known as the vacuum, of (fl|0(a;)|fi) = 0. As we’ve seen earlier, the 
excitations of this held are phions with mass m = /r. 


Classical field theory will be enough 
to show us the main features of broken 
symmetry. See S. Coleman and Peskin 
and Schroeder for more details on the 
quantum mechanics of broken symme¬ 
try. 


^More strictly U(4>) is the potential en¬ 
ergy density. It should not be confused 
with the internal energy, which is also 
conventionally called U . 


(a) U(<f>) 



(b) U{<j>) 



Fig. 26.5 Equation 26.5 for (a) ji 2 > 0 
with a minimum at (f> = 0 and (b) for 
fi 2 < 0 with minima at ±-\/6/z 2 /A. 
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9 From dU/d(j) = 0 we find 


0 — —p. <j> + — <j> , 

from whic h we de duce stationary points 
at (0, d=y/6/x 2 /A). The second deriva¬ 
tive, 

d 2 u _ 2 \<t > 2 

is negative (and equals — y?) for <j> = 0 
and is positive (and equals +2/x 2 ) for 
the minima at <j>$ = zb-y/6^x 2 /A. 


Now we turn to a very interesting possibility: what if we swap the 
sign in front of /r 2 ? In that case we have U{<f>) = — ^-</> 2 + and 

again have a potential that looks like Fig. 26.5(b). The minima 9 of the 
potential now occur at 


■to = *(?£)*, (26.6) 

so we have the choice of two new vacua. The system will spontaneously 
choose one and the symmetry <j )o —* — 4>o of the ground state is broken. 

Next we ask if the new vacuum has the same excitations (a.k.a. par¬ 
ticles) as those of the unbroken symmetry state. 


Example 26.2 

To do this we choose a vacuum (for definiteness let’s choose +<j>o) and we expand the 
field around this new minimum. The Taylor expansion gives us 

U(<t>-<t> o) = + (|~Q (0-0o) + ^j(^f) (</>- </>o) 2 + ■ ■ • 

= t/(0o) + /x 2 (0 - 0 O ) 2 -f- . .., (26.7) 

and since U(</> o) is an ignorable constant, we can write the Lagrangian in terms of 
<j>' = (f) — (f> o as 

c = ^(dtfr') 2 — ( 26 . 8 ) 


10 For the ordinary scalar field La¬ 
grangian, the factor multiplying the 
term quadratic in field is m 2 / 2, with 
m the mass of the particle excitations 
in the theory. 


11 In the quantum mechanical version 
we say that the ground state does not 
have the symmetry of the Hamiltonian. 


By comparison with our original scalar field theory, 10 we find from 
eqn 26.8 that the mass of the phion excitations in the broken symmetry 
state is now no longer m = /i but has become m = s/2n- 

An important point to note is that the Lagrangian does not ‘break 
symmetry’ itself. Indeed our Lagrangian, just like the Landau free en¬ 
ergy for the magnet, is invariant with respect to the transformation 
<f> —2 —(f>. The breaking of symmetry is a property of the ground state 
(or vacuum) of the system. 11 

To conclude, the effect of spontaneous symmetry breaking is (i) for </> 
or (again, briefly returning to quantum mechanics) the vacuum expec¬ 
tation value to acquire a non-vanishing, constant amplitude 

/r 2 \ 1/2 

(j) o, in this case = ( -j- ) ; (ii) the particle excitations of the theory 

are the same as before, but now have a mass 


26.3 Breaking a continuous symmetry: 
Goldstone modes 

The scalar field theory had a discrete global symmetry (<j) —> —</>). More 
dramatic and interesting still is the effect of symmetry breaking for a 
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theory with a continuous global symmetry. A two-component field with 
the sign of its mass term flipped looks like 

£ = 2 [(^m^i ) 2 + (^m^z) 2 ] + Tp (<Ai + <d) ~ 41 (^i + ^ 2 ) • (26.9) 

This has a global S l O(2) symmetry: it is symmetric with respect to 
rotations in the (internal) 4 >i(x)-(f> 2 {x) plane (where the same rotation 
takes place at all points in spacetime). The potential t/(<^> 1 , ^> 2 ) for this 
theory is sketched in Fig. 26.6(b) and looks a lot like the bottom of a 
wine bottle [as shown in Fig. 26.6(a)]. There are an infinite number of 
potential minima, and these are found to lie on a circle [Fig. 15.4(c)] 
whose equation can easily be found 12 to be </> 2 + <j >2 = ~x~- 

Let’s break the symmetry by supposing the system chooses a par¬ 
ticular vacuum. Each is as good as any other, so for computational 

simplicity, let’s choose (</>i,</f> 2 ) = and expand around this 

broken symmetry ground state using variables (f>\ = <j>i — \J and 
(j)' 2 = <^ 2 - This is illustrated in Fig. 26.7. 


Example 26.3 

With the potential t/( 0 i,</> 2 ) given by 

U(</> 1 , <t> 2 ) = ( 0 i + <A!) + -j-y ( 0 i + ^!) 2 > (26.10) 

Taylor expanding around the minimum <j> 2 ) = ( \J - 0 ) gives d 2 U/d(j>\ = 2 /.t 2 

and d 2 U/d<j>\ = 0 and so we obtain (ignoring constant terms and truncating the 
expansion at order <f > 2 ) 

£ = - [(90j ) 2 4 - [d^f] — £t 2 (</> i ) 2 + 0 «/ 3 ). (26.11) 


Equation 26.11 shows that the particles of the 4>i held now have mass 
m = \f2fi, but that there is no quadratic term for the <f>' 2 term: thus the 
particles of the <p 2 field are completely massless! This seemingly magical 
result is less surprising if we look again at the potential landscape shown 
in Fig. 26.7. We can see immediately that excitations in the <j>i direction 
will cost energy since such excitations have to climb up the walls of the 
potential. However, a small movement in the <f >2 direction corresponds to 
rolling round the gutter. Such motion has no opposing force as it involves 
no change in potential energy and therefore costs no energy, exactly what 
we would expect for a massless excitation. In other words, we still have 
particles in the <f >2 field, it’s just that their dispersion starts at the origin, 
rather than from some offset determined by the mass. In the language 
of condensed matter physics one would say that the excitations in the 
<f >2 field are gapless, rather than massless, but it amounts to the same 
thing. 


12 

To quickly show this, you can write 
U (#) = —tL-x- b^y # 2 where x = (f>\+(}> 
and then dU/dx = 0 yields the result. 



Fig. 26.6 (a) The potential for the 

50(2) symmetry breaking looks like 
the bottom of a punted wine bottle, 
(b) There is a maximum at the point 
(pl = <f>2 — 0 , but surrounding this 
there is a set of minima which lie on 
a circle, (c) The circle of minima are 
shown on a (pi-(p 2 plot (this is therefore 
viewing the surface sketched in (b) from 
‘above’). The symmetry can then be 
broken by choosing a particular point 
in the circle of minima and setting this 
to be the ground state. We can then ex¬ 
amine small deviations away from that 
point. 




Fig. 26.7 Breaking symmetry at 
the position (<pi,<p 2 ) = 


There are two possible excitations: the 
particle can oscillate up and down in 
the (pi direction (a massive excitation) 
or trundle round in the gutter (a mass¬ 
less excitation). 
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Jeffrey Goldstone (1933- ). The 
bosons had previously been discovered 
by Yoichiro Nambu (1921 ) in the con¬ 
text of superconductivity. 


13 

See chapter 12, although note that 
there we were dealing with a non- 
relativistic theory and wrote \I/(rr) = 
yj p(x)e ld ( x ^. Here, in dealing with 
the relativistic theory we write ijj(x) = 
g(x)e lG ( x ^ and so f d^x\g(x)\ 2 [rather 
than f d^x p(x)] gives the number den¬ 
sity of particles. 


14 Philip Anderson (1923—), Francois 
Englert (1932- ), Robert Brout (1928— 
2011), Peter Higgs (1929- ), Tom Kib¬ 
ble (1932- ), Gerald Guralnik (1936- ) 
and Carl Hagan (1937- ). Higgs and 
Englert shared the 2013 Nobel Prize 
in physics for their discovery (Englert 
and Brout worked together, but Brout 
died before the prize was awarded). 
Anderson (who won the Nobel prize 
in 1977 for other work) had a non¬ 
relativist ic version in 1962 which was 
inspired by superconductivity, and in 
1964 all the others (Brout and En¬ 
glert published first, closely followed by 
Higgs) produced relativistic treatments 
of what we now call the Higgs mecha¬ 
nism (though only Higgs mentioned the 
massive boson). 


This vanishing of the mass is a manifestation of Goldstone’s 
theorem, which says that breaking a continuous symmetry always 
results in a massless excitation, known as a Goldstone mode. The 
massless particle associated with this excitation is known as a 

Goldstone boson. 


Example 26.4 


Perhaps more illuminating is the complex scalar field theory. In this theory we’ll 
break global U( 1) symmetry. Of course U( 1) is isomorphic to SO( 2), so we won’t 
have anything too different from the previous example. The U( 1) theory, with a 
positive mass term, reads 

C = + AW- (26.12) 

This theory enjoys a global U( 1) symmetry: a transformation ip —> 'ipe 16 * has no effect 
on the Lagrangian. We switch to ‘p°l ar coordinates’ 13 via ^(x) = g{x)e and this 
theory reads 

C = (d^B? + Q 2 (<V) 2 + mV - V, (26.13) 

where the two fields are now called g(x) and 0(x). The symmetry transformation for 
this version is £>—>£> and 6 —> 6 + a. 

Let’s break this symmetry. The potential part is U = —p?g 2 + A^ 4 whose minima 
are on a circle of radius g = yj^ . We set go = \JP 2 /2A and (arbitrarily, but with 
convenience in mind) choose a minimum by selecting 6q = 0, then expand around 
the minimum with g' = g — go and O' = 6 — Oo, to obtain 

£ = (fx) (fl'-field terms) 

+(<9 M e') 2 “ 2 mV 2 - 4 (Y) 2 Q 13 ~ V 4 (e'-field terms) (26.14) 

j 5 £)'l ( d/j.9 ') 2 + .. . (interaction terms), 


where constant terms have been dropped and terms have been arranged according 
to the contributions of the two individual fields and then the interactions between 
them. As might be expected, there is no term in O ' 2 and so the 0'-field is massless. 
As before, this is because it costs nothing to roll in the gutter. The mass of the 
^/-field has been shifted tom= 


26.4 Breaking a symmetry in a gauge 
theory 

An extraordinary feature that emerges upon breaking a symmetry in a 
gauge theory was spotted by Anderson, Higgs, Kibble, Guralnik, En¬ 
glert and Brout and is famously known as the Higgs mechanism. 14 As 
we’ve said previously (see Chapter 14), a Lagrangian which has a local 
symmetry contains gauge fields. The simplest example is the gauged 
complex scalar field theory, whose Lagrangian (with a flipped signed for 
the mass term) is given by 

C = (dV f - i qA^)(d^ + i qA^) + - A^V) 2 - 

(26.15) 
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which is symmetric under the local transformation —> tpe la ^ as long 
as we also transform A p —> A p — ^d p a(x). This Lagrangian describes a 
world made up of two sorts of oppositely charged, massive scalar particles 
with energies E p = ( p 2 + ^t 2 )? and two sorts of transversely polarized 
photons with energies E p = |p|, which are, of course, massless. 

Now we'll see what happens when we break symmetry in a system 
with a local symmetry. Working in polars, the ground state has the 
held tp(x) = g(x)e lS ( x ^ and takes on a unique phase angle 9(x) = 9 o for 
all x. We are no longer permitted to change the phase of the ground 
state at different values of x (local symmetry) or indeed change the 
phase of the ground state for the entire system (global symmetry). The 
broken symmetry ground state therefore breaks global symmetry and, 
as a consequence, local symmetry too. 

Next we turn our attention to the excitations. Remember that the 
Lagrangian is gauge invariant so we are able to make gauge transforma¬ 
tions to simplify the physics as much as possible. In the next example 
we show that the gauge transformation we make to elucidate the par¬ 
ticle spectrum of the broken symmetry system reveals something quite 
exciting. 


Example 26.5 

In polar coordinates, the second bracketed term in eqn 26.15 can be written 

d P i> + iqA p ip = (d^e)e l£f + i{d p 0 + gA M )ge 10 . (26.16) 


Thus we notice that A^ enters the theory as 

A M + -d^O = C,x, (26.17) 

Q 

and so we will replace A M by C^, thereby simplifying the term —iqA^'ip^)(d^,'ip-\- 

i qA^ip) = (d^g) 2 + g 2 q 2 C^C^. The replacement is a gauge invariant one since 
F^lv = d^A u — d v A M = d^Cu — d u C^i. The Lagrangian in eqn 26.15 can now be 
written in terms of g and fields as 

c = (d^e ) 2 + Q 2 q 2 C 2 + mV - A £> 4 - ^Fl* v F pv , (26.18) 


where we use the shorthand C 2 = C^C^. 

Now t o br eak the symmetry! The minima of the potential are ag ain on the circle 
j u 2 / „2 

at g = yj We choose the symmetry to be broken with Qo = y 2 \ an< ^ $0 — 0. 

The expansion that reveals the excitations above the new ground state is most easily 
done by expanding in terms of a field y, defined as -A= = g — go- Ignoring constant 

i • v2 

terms we obtain 


C 


\( d rX) 2 - n 2 X 2 - '/Xyx 3 ~ jX* 

-\f^f^ + ^-c 2 


y 


l 


+«r V xC" + -q x C +■■■■ 


(26.19) 


where M = 
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1 5 Recall from Chapter 13 that the La- 
grangian for a massive vector field is 


C = F^F, 




1 The notion of a pure gauge will ap¬ 
pear again in our treatment of topolog¬ 
ical effects in Chapter 29. 


Sidney Coleman (1937-2007) 
N. David Mermin (1935- ) 
Herbert Wagner (1935- ) 


As usual, we have the contribution that derives from the radial field (here 
called x), which has a mass However, there is a big surprise here 

in the second line of eqn 26.19: the theory also now contains a massive 
vector field C p , whose particles have mass M. But what of the 9 field? 
It has completely disappeared! The excitations of the 9 field, which were 
massless in the global symmetry breaking version, have disappeared and 
have been replaced by those of a massive vector field 15 C p (x). As Sidney 
Coleman put it, it’s as if the massless photon field A p (x) has eaten the 
Goldstone bosons from the 9 held and grown massive, changing its name 
to C p (x). The theory now describes one sort of scalar particle, which is 
an excitation in the x( x ) field with energy E p = ( p 2 + 2^ 2 )s, and three 
sorts of massive vector particle, which are excitations in the C p (x) held 


m 

that thi 


with energies E p = |£C 

One might now worry that the loss of the Goldstone bosons means 
that we have lost some of the precious degrees of freedom in the mathe¬ 
matical description of the helds. This worry is unfounded because upon 
symmetry breaking we have the change 


I 2 x massive scalar particles \ / 1 x massive scalar particles \ 

^ 2 x massless photon particles J ( 3 x massive vector particles J ’ 

(26.20) 

that is, four types of particle are excitable before and after we break 
symmetry. 

So where did the Goldstone boson go? The answer becomes clear when 
we reexamine what we did to remove it: we made the change of variables 
A p + ^d p 9 = C p , which is a gauge transformation! If the Goldstone 
particle can be removed with a simple gauge transformation then it 
can’t have been there in the first place. In other words, it must be pure 
gauge. 16 Another way of describing the Higgs mechanism, therefore, is 
the removal by gauge transformation of all Goldstone modes. 

This feature of symmetry breaking, where the Goldstone mode may 
be removed and turned massive through combination with a gauge field, 
has become one of the most important in modern physics. Historically 
it told particle physicists, puzzled by that apparent lack of Goldstone 
particles in Nature, that symmetry breaking allows another way. We 
will return to this undeniably important topic in Chapter 46. 


26.5 Order in reduced dimensions 


17 

We use the theory of a complex scalar 
field ip(x) since there a continuous U( 1) 
internal symmetry is broken, leading to 
the emergence of a Goldstone mode. 
The presence of the Goldstone mode is 
crucial here. 

^ Strictly we should write the integra¬ 
tion measure as f , but we will 

abbreviate it here to save on clutter. 


Field theory allows us to ask questions about systems which have less (or 
more!) than the ordinary three spatial dimensions of our world. One fa¬ 
mous result is the Coleman—Mermin—Wagner theorem, which tells 
us that symmetry breaking is impossible in two or fewer spatial dimen¬ 
sions in systems with an order parameter with a continuous symmetry. 

We’ll show that this is the case for complex scalar field theory with 
spontaneous symmetry breaking. 1 ' The idea of the proof is to ask what 
the fluctuations look like at one point in space. We get this by evaluating 
the Euclidean (two-point) correlation function at the origin. We want 18 
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G(0,0) = (|V>(0)| 2 ) = |/zw( 0)| 2 e- SE ^ 

= lim ^ / Vij) ipfx)^ (0)e _,SE ^. (26.21) 
x—>0 Z J 

We can evaluate this since we know that the propagator is given in 
d-dimensional Euclidean space by 

i r r Ad^ p—i p-(x-y) 

G{x,y) = {i>(x)ft{y)) = - V-ipip(x)ip*(y)e~ SlLW = / — y- 

Zj J J ( 27 r) a p z + m z 

(26.22) 

If the system breaks symmetry then the propagating particles will be 
Goldstone particles. These are massless, so give rise to a fluctuation 
with mn = 0 and result in a correlation function 

= fwr4' <m23) 

whose integrand contains a singularity at p = 0. For d > 2 the singular¬ 
ity is integrable and the fluctuations in the field are well behaved. For 
d < 2 the singularity is not integrable and the fluctuations diverge. The 
divergent influence of Goldstone bosons breaks up the order in the sys¬ 
tem. We conclude that symmetry breaking is impossible in our complex 
scalar field theory for d < 2. 

Chapter summary 

• In many phase transitions, symmetry is broken below T c and the 
ground state possesses only a subset of the symmetry of the Hamil¬ 
tonian. 

• Breaking a continuous symmetry results in a massless excitation, 
known as a Goldstone mode or Goldstone boson. These are not 
stable in two or fewer dimensions and symmetry breaking is then 
not possible (Coleman-Mermin-Wagner theorem). 

• In a gauge theory, the Goldstone mode can be removed and be¬ 
comes massive through combination with a gauge field (the Higgs 
mechanism). 


A simple example of the Coleman- 
Mermin-Wagner theorem can be found 
in the behaviour of a ferromagnet. 
Magnons (quantized spin waves) are 
bosons and cost energy E p = ap 2 (in 
the low p limit) where a > 0 is a con¬ 
stant. Thus the number of magnons ex¬ 
cited at temperature T is proportional 
to 

f djpj jpl^" 1 

J e PE P — 1 

The integrand is proportional to \p\ d ~^ 
at low momentum (assuming T / 0), 
and therefore diverges when d < 2. 
Thus long-range order in a ferromag¬ 
net is wiped out by spin waves at all 
nonzero temperatures in two or fewer 
dimensions. This argument uses no 
quantum field theory, and holds only 
in this specific case (the analogous ar¬ 
gument is more elaborate for antifer- 
romagnets, see the book by Auerbach) 
but it is apparent that the mathemat¬ 
ical origin is the same as that used for 
G(0, 0) in eqn 26.23, namely that an in¬ 
tegral of this form diverges when d < 2. 


Exercises 

(26.1) In this problem we prove that if the vacuum is in- energy) will emerge. 

variant under an internal symmetry then a multi- Consider creation operators for two types of parti- 

plet structure (that is, a different field with the same 
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cles and with the property 

[Qn, 4>i] =4>i, (26.24) 

for some generator of a symmetry group, such that 
[On, H] = 0. We impose the condition 

On|0)= 0. (26.25) 

(a) Show that e iof<5N |0) = |0). 

(b) Show also that 

E a = E b . (26.26) 

where H$ A |0> = E A f> jjO) and TL^O) = £ B <^|0). 
Note that symmetry breaking occurs in the con¬ 
trasting case that Qn|0) ^ 0. 

(26.2) The Fabri-Picasso theorem demonstrates that if a 
Lagrangian is invariant under an internal symmetry 
with charge operator On = / d 3 * J°, then there are 
two possibilities: either 

(i) (as above) we have Qn|0) = 0; or 

(ii) 0n|0) has an infinite norm. 

Here we prove this fact. 

(a) By using the translation operator e lp a show that 

<0 |j£(x) 0 n| 0) = <0 |j£( 0)0 n! 0). (26.27) 

(b) By considering the matrix element (0|QnQn|0) 
along with the result of (a), show that either 
0n|0> = 0 or 0n|0) has an infinite norm. 

Note that if Qn|0) has an infinite norm, then it does 
not exist in the same space as |0). This is the case 
in spontaneous symmetry breakdown. In this case, 
the state Qn|0) is not zero, but another possible 
vacuum state. See Aitchison and Hey, Chapter 17 
for more details. 

(26.3) We'll work through a famous proof of Goldstone’s 
theorem. This can be found in many books. Start 
with a theory with a continuous symmetry with 
charge Qn, where Qn|0) ^ 0. Consider a field ip(y ) 
which is not invariant under Q n such that 

[Qn,<K</)] =$(l/), (26.28) 

where ip is some other field. We are going to exam¬ 
ine (0|'i/>(0)|0), which we assume takes on a nonzero 
value when symmetry is broken. 

(a) Show that 

JL(0${0)\0) = - J dS ■ (0| [j n (x),^(0)] 0), 

(26.29) 

where Jn is the space-like part of the Noether cur¬ 
rent. Hint: You should assume = 0. 

Since the commutator in the previous equation 
contains local operators potentially separated by 


a large space-like interval, we conclude that 
<O|-0(O)|O> is independent of time. 

(b) Insert a resolution of the identity to show 

/'d 3 a:^[(O|j5(O)|n>(n|0(O)|O)e- ij ’^ 

n 

- (O|0(O)|n)<n|j£(O)|O>e ip "' ir ] ^ 0. (26.30) 

(c) Evaluate the spatial integral to verify 

^(2tt)V 3) (p„) [(0|J£(0)|n)(n|^(0)|0) e lp ° n X ° 

- (0|Vi(0)|n)(n|j£(0)|0)e- ip "' a:O ] ^ 0. (26.31) 

From (a) we conclude that this expression is inde¬ 
pendent of x°. 

(d) Argue that for |n) = 10), eqn. 26.31 vanishes. 

(e) Argue that if |n) describes a massive particle 
state then matrix element (n| j£|0) must vanish. 

(f) Argue that if we want (n| j£|0) ^ 0 then we re¬ 
quire p° —► 0 as p n —> 0. 

(g) This proves Goldstone’s theorem: the states 
linked to the ground state via the Noether current 
are massless Goldstone modes. Convince yourself 
that this is so! Again, see Aitchison and Hey for 
more details. 

(26.4) Consider another gauge theory: complex scalar 
electrodynamics with a symmetry breaking mass 
term, whose Lagrangian is 

C = -\e^F pv + |£>„^| 2 - V(iP), (26.32) 

where V(ip) = —m 2 {ip^ip) + ^(ip^ip) 2 and D M = 
d M + i gA M . 

(a) Take the broken symmetry ground state to be 

ip 0 = [j 5 . Expand about this using ip = 

ipo + + 102 ) and show that the potential of 

the broken symmetry theory is 

U(<P i,60 = m 4 + m 2 f>i + 0«> 3 ), (26.33) 

exactly as in the case without electromagnetism. 

(b) Consider the kinetic energy term. Show that, 
on symmetry breaking, this becomes 

ID^ipl 2 = ^(d^i) 2 + (26.34) 

+V2qip 0 A l _ l d p <p 2 + q 2 iplA tl A p + ... 

Interpret this result in terms of massive photons. 
This example shows that the massive gauge parti¬ 
cle in a broken symmetry system may be found by 
expanding the covariant derivative. 



Coherent states 



’Tis all in pieces, all coherence gone 
John Donne (1572- 1631) 

There exists a class of problems in which a macroscopic number of 
quanta all pile into the same momentum state. Examples include the 
laser, where we put a large number of photons in the same wave-vector 
state into a cavity, and the superfluid, where a macroscopic number of 
Bose particles sit in a zero-momentum state. The best way to describe 
such systems uses a set of basis states discovered by Schrodinger and 
christened coherent states by Roy Glauber. A coherent state is an 
eigenstate of an annihilation operator. In real space coherent states are 
Gaussian wavepackets. As the occupation of the coherent state becomes 
large, the wavepacket becomes more and more narrow, increasingly re¬ 
sembling an object with classical properties. In this chapter we examine 
the properties of these states. Having built up quantum field theory from 
an analogy between identical, non-interacting particles and harmonic os¬ 
cillators we start with the coherent states of the harmonic oscillator. 


27.1 Coherent states of the har¬ 
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Roy Glauber (1925- ) 


27.1 Coherent states of the harmonic 
oscillator 

The harmonic oscillator has a Hamiltonian H = to (a^a + -)). Recall 
from Chapter 2 that this was obtained from the Hamiltonian 1 H = 
ui(P 2 + Q 2 ) by the introduction of creation and annihilation operators 
a and a) . This produced energy eigenstates | n) containing n quanta 
and with energy eigenvalue (n + |)w. Thus (n|(P 2 + Q 2 )\n) = n + 
1 , but P and Q are individually not good quantum numbers. 2 For a 
classical harmonic oscillator, the position and momentum behave like 
Q = Qo cos(uit — i p) and P = — P 0 sin(wt — (p), so that both quantities 
oscillate but there is a definite 90° phase relationship between them. 3 A 
quantum mechanical state that possesses this phase relationship is what 
we are after, and so we are led to consider eigenstates of the operator 
combination Q + iP in which this phase relationship is hard wired. Since 
a = (Q + iP)/y/2, we thus seek eigenstates of a. We therefore define a 
coherent state |a) by 

a\a) = a\a), ( 27 . 1 ) 

where a is the annihilation operator and a is the eigenvalue (which will 


1 In this chapter we use reduced vari¬ 
ables P 2 = 2 )“j an d Q 2 = \mujx 2 , 

and hence 

Q=75(“ + “ t )> P = ~75(“- “*)• 

2 In fact ( Q } = (P) = 0 for energy 
eigenstates. 

o 

This feature is not captured in our 
quantum mechanical treatment of the 
harmonic oscillator in Chapter 2, since 
energy eigenstates |n) are, by defini¬ 
tion, stationary states and individually 
do not possess any signature of any¬ 
thing oscillating at angular frequency 
a;. Coherent states will achieve this 
by being combinations of energy eigen¬ 
states. Coherent combination will give 
rise to interference, and this will recover 
wavelike behaviour. 
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4 There is no restriction on a being real 
since a is manifestly not Hermitian. 


5 In eqn 27.2 we have used \n) = 

^t=-|0 ) in the second line. 

Vn! 


P n 



Fig. 27.1 The Poisson distribution 
P n = e - l Q! l 2 |a| 2n /n! = e~^(n) n /n\ 
plotted for |a| 2 = (n) = 4. Note that 
a coherent state consists of a sum of 
harmonic oscillator states: the quantity 
P n represents the probability of finding 
this coherent state in the nth harmonic 
oscillator state. 

5 In fact this whole example can be done 
very quickly if you use a property of 
a Poisson distribution which is that its 
mean and variance are the same. In this 
case, the mean (n) is |a| 2 , and hence so 
is the variance, meaning that An, the 
square root of the variance, is simply 


be a complex number). 4 Of course, a coherent state will be a sum of many 
energy eigenstates and so we may write \a) = c nW)- Substituting 

this into eqn 27.1 leads to the recursion relation c n +i = ac n /y/n + 1, 
and hence 5 

'“> = c "( | 0 >+ ^t |1> + S | 2 >+ 3 S |3>+ ^) 

= CO (l + + ^(afi 2 + |V ) 3 + ...) 0), (27.2) 

and normalization (a\a) = 1 fixes Co = e - l“l / 2 . Thus we can write the 
coherent state compactly as 

|a) =e^e aat |0). (27.3) 

|„|2 - 

The amplitude to find |a) in the state |n) is c„ = (n|a) = e 2 a n /vn!, 
corresponding to a probability density P n = \c n \ 2 = e“l Q l \a\ 2n /n\ and 
this represents a Poisson distribution (see Fig. 27.1). In the next exam¬ 
ple, we establish some basic properties of this probability distribution of 
coherent states. 


Example 27.1 

We don’t know the number of quanta in a coherent state but we can work out the 
average number 6 

(h) = (a|ala|a:) = \oi\ 2 . (27.4) 

We can calculate the uncertainty An in the number of quanta in a coherent state 
from An = (n 2 ) — (n) 2 . Thus by working out 

(n 2 ) = (a|a^aa^a|a) = |a:| 4 + |a:| 2 , (27.5) 


we find that the uncertainty An is 


An = |a|, 


(27.6) 


and therefore the fractional uncertainty is 

An |a| 

W) = W 


i 

M' 


(27.7) 


We conclude that although we don’t know exactly how many quanta are in a coherent 
state, the fractional uncertainty in the number tends to zero as a (and hence the 
average occupation) tends to infinity. 


Example 27.2 

In this example we see how coherent states respond to an operator U(9) = e —10n , 
where 6 is a real number and n is our usual number operator. Let’s practise first on 
the state | n) (where n|n) = n|n)). This is easily done when we realize that U{6) can 
be written as a power series and hence 


(l + (-i On) + 


(—i eh) 2 





urn*) 


2 ! 


(27.8) 
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since any power of the operator h on the state |n) just gives that power of the number 
n. This now allows us to work out 


um«) 



n =0 


(. ae~ w ) n 
\frt\ 


|n) 


|ae- ie >. 


(27.9) 


Thus the action of U(6) on the coherent state |a) is to turn it into another coherent 
state with an eigenvalue multiplied by e -1 ®. 


This last result tells us that the time-dependence of the coherent state 
| a) under the Hamiltonian H = (n + |)w is given by 7 

| a(t)) = e~ i6t \a(0)) = e- iuJt/2 \a(0)e~ iwt }. (27.10) 

The exponential prefactor e~ lult / 2 is a simple phase factor originating 
from the zero-point energy, but the main message should be read from 
the state |a(0)e _lwt ): coherent states remain coherent over time and the 
eigenvalue a executes circular motion around the Argancl diagram at an 
angular speed to. It looks as if we have recovered the coherent oscillatory 
behaviour we were after. But is there a way to visualize these states? 


7 

Equation 27.10 should be contrasted 
with the analogous result for energy 
eigenstates of the harmonic oscillator. 
Equation 27.8 gives 

|n(t)> = e _i(7l+ 5 )u,t |n(0)>, 

so in this case the only time- 
dependence in the state is found in the 
uninteresting phase factor. 


27.2 What do coherent states look like? 

To see what a state looks like as a function of some coordinate Q we 
project it along Q to form a wave function, via the amplitude ( Q\a ). 


Example 27.3 

Let |Q) and |P) be the eigenstates of Q and P. We write a|oj = -^= ( Q + i p) |o) = 
a\a) and act on the right with a state (Q| to obtain 

-^<Q|(Q + iP)|a>=a(Q|«>. (27.11) 

Since Q\Q) = Q\Q) and P = —i^^, we can make the replacements (Q\Q = Q(Q\ 
(since position is real) and (Q\iP = from which we obtain the differential 

equation for the wave function: 

??(Q|a> = - (Q - V2a) <Q|a>. (27.12) 

This equation has the normalized solution 

(Q\a) = J r e-(<5-^2“) 2 /2 ! (27.13) 

7T 4 

so the wave function is a Gaussian. This one is displaced by \/2a = (Q) + i(P). You 
can also show that the wave function in momentum space is given by 

(P|ct) = A r c-(P+iV 2 a ) 2 /2 (27.14) 

7V 4 

Using the position and momentum wave functions we can work out the uncertainty 
in position and momentum. This is most easily done via the definitions 

(AQ) 2 = (Q 2 ) - (Q> 2 , (AP) 2 = (P 2 > - (P) 2 . 


(27.15) 
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P 

* 



Fig. 27.2 A coherent state can be pic¬ 
tured as a ‘blob’ orbiting the origin in a 
plot of P against Q. Its mean position 
is given by (Q) = v2Reo; and (P) = 
v^Ima, where a(t) = a(0)e _ia;t . It 
is a blob, rather than a point, because 
there is uncertainty in both Q and P, 
given by AQ = AP = (see Exer¬ 
cise 27.3). 


We find (Exercise 27.3) that 

(AP) 2 (AQ) 2 = I. (27.16) 

Comparing this with the uncertainty principle AQAP >1/2 shows that the coherent 
states have the minimum possible value of uncertainty. This is the sense in which 
coherent states are the closest quantum mechanics allows us to come to classical 
objects which are localized in position and momentum space. 


Finally we ask what these states do as a function of time. We have 
shown that the state |a(0)) evolves to |a(i)) (apart from an uninteresting 
phase factor) where a(t) = a(0)e _lwt . An immediate consequence of 
this is that the expectation value of the position and momentum of 
our state may be calculated. Since we have (a) = (a|a|a) = a and 
(d'f) = (a|at|a) = a*, we also have 

<Q) = ^=- and (P)=- (27.17) 

and we find that (writing a(0) = «oe lv ) 

(Q) = V2ao cos (cot — ip) and (P) = — V2ao sin(o;t — ip), (27.18) 

meaning that the expectation values execute simple harmonic motion 
(see Fig. 27.2). These results are exactly as we expected for the classical 
harmonic oscillator at the beginning of the chapter. 

27.3 Number, phase and the phase 
operator 

Because a is a complex number, we can write a = |a|e lS and obtain 

| 0 >. 

>7.19) 

If we differentiate with respect to the phase 6 we find that 

_i J^l a ) (27.20) 

This implies that just as we have conjugate variables momentum and 
position linked via p = —igy, we identify conjugate variables n and 8. 
However, a closer examination of the phases of states shows that some 
care is needed in their interpretation. The definition of an operator that 
measures the phase of a system turns out to be rather problematical. 
One possibility suggested by Dirac is to make a phase operator via 

a = 

a f = v^e^f (27.21) 

These already look rather odd in that it’s unclear how to interpret the 
square root of the number operator. We’ll examine the consequences of 
these definitions. 


\a) = e 


= e 2 


1 + 


1 ! 


|a| 2 e“ iy (dT ) 2 |a| d e die (dT)' :i 
2! + 3! 
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Example 27.4 

The exponential operators can be rearranged to read 

e 1 ^ = a(n) - 2, 

e -i ^ = (n)“ia t . (27.22) 

Since we know that a) a = h, then we must also have 

e -i4>j4> = fi-ihh~i = 1. (27.23) 

However, if we try to find e 1<J ^e — ^ we encounter a problem since 

e i<£ e -i<£ = (n)- 1 ^ + 1) ^ 1, (27.24) 


which means that the exponential operator e 1 ^ is not unitary. Since these exponential 
operators are not unitary, the phase operator <j) cannot be Hermitian 8 and so cannot 
be used to describe an observable. There’s another problem. The phase operators 
yield a commutation relation (exercise) 


|n,e i( ^J = -e^, (27.25) 

which implies 9 that we have 

\h, <j>] = i. (27.26) 

This, in turn, gives us a number-phase uncertainty relation 

AnAcj> > 1. (27.27) 

This is a problem if taken literally, since if we take An to be small we must have A<j> 
greater than 2-7T. Dirac was so troubled by these inconsistencies that he left these 
operators out of the third edition of his seminal textbook The Principles of Quantum 
Mechanics. 


® Remember that U = e l H . where U is a 
unitary operator and H is a Hermitian 
operator. 


9 If [A, B] = c then [A,e A ®] = A ce x ^. 
We take A = h, B = <f> and A = i. It 
follows that c = i. 


One possible way forward was suggested Iny Peter Carruthers in the 
1960s. 10 Here we regard the exponentials e ±I< ^ as a whole as the fun¬ 
damental operators. For things to work out we need the exponentiated 
phase operators to obey the relations 

e l <t>\n) = \n — 1), 

e^l n) = \n + 1). (27.28) 


We may then form Hermitian operators from the exponentials via the 
definitions 


cos (j) = ^ (e 1 ^ + e -i ^) , 

sin (f) = — (e 1 ^ — e _1<?i ) . (27.29) 


Peter Carruthers (1935-1997) is per- 
haps best known for his leadership of 
the theoretical division at Los Alamos 
National Laboratory from 1973-1980, 
overseeing a notable period of expan¬ 
sion. 

1 Actually by three students in his 
class. His problem set contained the 
following: Apparently no one has in¬ 
vestigated whether quantities <j> and y/h 
1 defined’ by equations (1) really exist. 
A bonus will be given for an answer to 
this question. The bonus turned out 
to be a bottle of beer. See M.M. Ni¬ 
eto, arXiv:hep-th/9304036vl for more 
details. 


These operators, which do not commute with each other, have the prop¬ 
erties 


h, cos cj) 
n , sin 6 


= ism </>, 
= —icos 6 


(27.30) 
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and uncertainty relations 

An A cos 4> > 

An A sin <fr > 

Carruthers’ prescription then gives us a meaningful phase operator to 
play with, as long as we agree only to ask questions about sines and 
cosines of the phase. In the next section we examine some examples of 
coherent states. 

27.4 Examples of coherent states 


^ | (sin <j)) |, 

^IW)|. (27.31) 


11 It is possible to show that 
sin Ov. 

A cos d>u = -. 

2|a fc | 

A proof is given in the book by Loudon. 


Example 27.5 

A box containing an electromagnetic field (or in the jargon, a cavity containing 
photons) allows various normal modes. If we put a fixed number of photons into the 
box and have them all in a single mode labelled by a wave vector k , then they will 
be in an eigenstate of fife, which we will write as |rife). Since fife|rife) = rife|rife), we 
know precisely the number of photons (i.e. we know rife) and we have absolutely no 
uncertainty in this number (Arife = 0). However, we have complete uncertainty in 
the phase of this state. 

If instead we put photons, still all with the same wave vector /c, into a coherent 
state |afe), we are now uncertain about how many photons we have (Arife = |afe|), 
by the uncertainty in phase 11 Acos^fe —> 0 as \otk\ oo. As we put more photons 
into this state ((fife) = |c*fc| 2 ) then |afe| grows and the state becomes more phase 
coherent. This situation is a good model for what occurs inside a laser. 


12 Superfluids are described in Chap¬ 
ter 42 and superconductors in Chap¬ 
ter 44. 


Example 27.6 


Another important use of the coherent states is in describing the matter fields of 
superfluids and superconductors. 12 These phenomena are examined in detail a little 
later, but here we introduce a method of describing superfluids in terms of coherent 
states. Like the laser, the superfluid also contains a large number of quanta in the 
same quantum state. The particles are interacting bosons and are described by a 
coherent state 


W = \ a P 0 a Pi-)- 


(27.32) 


The coherent state |i/>) is an eigenstate of the field annihilation operator ^(a?) given 

by 


'£'(*) 


1 


E a pe ip 


(27.33) 


so that 

'l'(cc)IV’) = i>{n)\i>). 


(27.34) 


The eigenvalue ip(x) can be written simply as 




1 

Tv 




(27.35) 
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and is often called a macroscopic wave function, for reasons which will become 
apparent. 13 

In the superfluid a macroscopic number of particles reside in the p — 0 state. We 
then have ijj{x) = ao/W, a simple complex number which we can write as y/no e 10 °. 
Thus the expected number density of bosons is |o;o| 2 /V = no and since no is very 
large (we have macroscopically occupied the coherent state), the uncertainty in the 
phase is vanishingly small. 

We conclude that the macroscopic wave function ^o{x) for a state where the p 
state is coherently occupied is best written 

4>o(x) = x/noe l8 °, (27.36) 

where 6q is a constant field. We can therefore think of superfluid order as a sponta¬ 
neous symmetry breaking of phase symmetry, resulting in the phase becoming fixed 
to the same value everywhere. This is examined in more detail in Chapter 42. 


Chapter summary 


13 

Note that the expectation value 
(V>|'E'(a;)|V>) = ip(x) = Ep Q P eip a! . 
which is significant since for 
occupation-number states (with 
fixed numbers of particles) we would 
expect 

(n p n P2 ...\^(x)\n Pl n P2 ...) 

= Ep(”Pi«P 2 -l“pl n Pi Tl P2-) eiP ' 3! 

= o, 

since {n p . \a p . \n p .) = 0 for all i. So 
clearly the interactions between bosons, 
which lead to a coherent state |i/>), are 
doing something very special. In fact, it 
will transpire that x ) is the order pa¬ 
rameter for the ordered, coherent state 
of bosons, known as a superfluid and 
discussed in Chapter 42. 


• A coherent state is an eigenstate of the annihilation operator with 
eigenvalue a, a complex number. 

• An occupation-number state contains a fixed number of particles 
but the phase is not well defined. A coherent state does not contain 
a fixed number of particles, but that number is Poisson distributed 
with mean |a| 2 . Its phase however can be well-defined, and its 
uncertainty decreases as |a| increases. 

• Coherent states can be used to describe lasers and superfluids. 


Exercises 


(27.1) Verify that 

He I Q, I 2 I Q I 2 

(a|/3) = e“ p (27.37) 

Compare this expression to the analogous one 
for energy eigenstates of the harmonic oscillator, 
(m\n) = Smn■ Why is there a difference? Use these 
results to show that (n\a\n) = 0, but (a|a|a) = a. 
Again, why the difference? 


(c) Using the previous results, show that the uncer¬ 
tainty relation for coherent states is indeed 

(AP) 2 (AQ) 2 = I. (27.39) 

(27.4) Verify that the momentum space coherent state is 
given by 

(P\a) = _L e - (p + i ^“ )2 . (27.40) 

7T 4 


(27.2) 


(27.3) 


Show that 


(Q) + i(-P) 

V2 ' 

(a) Show that (Q) = \/2Rea, (P) 
(Q 2 ) = | + 2(Re a) 2 and (P 2 ) = i + 

(b) Show that AQ = A P = -t*. 


(27.38) 

= \/2Iina, 
2(Ima) 2 . 


(27.5) Show that 

(Q-<Q>)|a>=-i(P-(P>)|a>. (27.41) 

One way of stating this result in words is that, for 
coherent states, the uncertainty is balanced between 
position and momentum. 




254 


Coherent states 


(27.6) 

(27.7) 


Show that 

[h,e i<? ] = -H. (27.42) 

Once again consider the forced quantum oscillator 

L = ^mx(t) 2 — ^mu> 2 x(t) 2 + f(t)x(t), (27.43) (27.8) 

where /(f) is the force which you should assume 
acts only for a finite length of time: 0 < t < T. 

The evolution of ( x(t )} will be given by the sum 
of the time dependence in the absence of the force 
added to the response: 

x(t) = x 0 (t) + J dt'x(t — t')f(t'). (27.44) 


(a) Show that x{t) is given by 


x(t) = 


( 1 


\ 2 mco 


)'(■ 


„ —ia )t . +•1 cvt 

ae +a e 


) 


(27.45) 


+9(t — t') f dt' 1 ^^ fe _i 
J 2muj V 


— ic o(t—t) ^ic o(t — t 


'>)• 


and find the energy imparted to the system by the 
source. 

(f) Find an expression for \a) in terms of the un¬ 
forced ground state |0). 

Consider (yet again) the forced quantum oscillator 
from the previous question with the forcing part ex¬ 
pressed as an interaction. 

(a) Show that 


-i [ dftfi(t)|0> = -^4r|l). (27.49) 

J (2 mu>) 2 

(b) By considering the S-matrix, show that the am¬ 
plitude for the source to create a state containing a 
single quantum is given by 


i/H „( Dumbbell) 
1 ® 

(2mw) 2 


(27.50) 


Hint: Use the definition of the susceptibility in Ex¬ 
ercise 21.2. 

(b) For times t > T verify that this may be written 


x(t) = 


2mu> 


a + 


(2 mu>) 


(2 raw) 

I e 1 


r/H) e_iwt 

.(27.46) 


(c) Argue that the Hamiltonian must take the form 


where (Dumbbell) denotes the Feynman diagram in 
Fig. 22.2. 

(c) Extend the previous result to show that the am¬ 
plitude for the source emitting n quanta is given by 

An =^=(JMA e (Dumbbell) (27 51) 

\fn\ y (2 mu>) 2 J 

(d) Show that the probability of the source emitting 
n quanta is given by 


H = oj 


a f - 


(2 mw’ 


t/(-w) U + 


( 2 muj) 2 


r/H 


(27.47) 

(d) Show that this Hamiltonian is diagonalized by 
the coherent state |a) which is the eigenstate of the 
a operator and find a. 

(e) Show that the number of quanta emitted by the 
source is 

I/HI 2 


(n> = 


(27.9) 


(27.48) 


P n = — L e -H 2 , (27.52) 

n\ 

where |a| = -L£C0J^. Comparing with the previous 

( 2 mu>) 2 

question, explain why this result was inevitable. 

A coherent state is an eigenstate of the annihila¬ 
tion operator a. Why do we never bother with the 
eigenstates of the creation operator a* 2 


2 mu 



Grassmann numbers: 
coherent states and the 
path integral for fermions 



Thus far, our discussion of fields and coherent states has concentrated 
exclusively on bosons. We would also like to describe fermions with path 
integrals and coherent states. This turns out to be a non-trivial matter . 1 

Consider the path integral, for example. In calculating Z[J] for Bose 
fields we have argued that we integrate over the classical action in the 
presence of sources Z[J\ = f T><j)e~ 1 ^ A X ( c +J<l > )_ Unlike the canonical 
formalism, the path integral formulation doesn’t deal with operators 
and their commutation relations; just classical fields. (The quantum be¬ 
haviour comes about through the constructive interference of the classi¬ 
cal paths.) This formulation works for Bose fields because the ordinary 
numbers outputted by the functions over which we integrate commute , 
just like the quantum operators of the canonical formalism. 

As we discussed in Chapter 3 the operator-valued fields that represent 
fermions in the canonical approach anticommute. This is the property 
that, for example 

(28.1) 
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^For this reason, Chapter 28 « 

can be 

skipped on a first reading. 

Grass- 

mann numbers will only be used 
tion 38.2. 

in Sec- 


where ip( x ) is an operator that annihilates a fermion at spacetime point 
x. In order to be able to describe fermions with a path integral we need 
a new sort of number: a number that anticommutes. 

Even before the advent of quantum mechanics, the properties of such 
numbers had already been examined by Hermann Grassmann. Although Hermann Grassmann (1809-1877) 
Grassmann’s work was largely neglected during most of his life, the 
anticommuting numbers he invented are now known as Grassmann 
numbers. In this chapter we will examine the properties of Grassmann 
numbers. This will allow us to write down a coherent state for fermions 
and will allow us to write a path integral for fermions. The latter will 
be useful in our description of the quantum field of the electron. 


28.1 Grassmann numbers 

Grassmann numbers anticommute. This means that two Grassmann 
numbers 77 and £ have the property that 


vC = -<»?• 


(28.2) 
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Immediately following from this is that ?y 2 = 7777 = —ijr] = 0. In words: 
the square of a Grassmann number is zero. This simplifies the algebra 
of Grassmann numbers. Because rj 2 = 0 we can’t have any terms in the 
Taylor expansion of a function with more than a single power of 77 , and 
so the most general function of rj is then given by 


2 

Real numbers commute with Grass¬ 
mann numbers. 


f{v) = a + bri, (28.3) 

where a and b are real numbers . 2 

We’d like to be able to do calculus with Grassmann numbers. We 
define the act of differentiating via 

= 1, £ja = 0. (28.4) 

Note also that this derivative operator itself anticommutes, so if £ is 
another Grassmann number we have = — = — C- 


Example 28.1 

Differentiating the most general function of a Grassmann variable, we have 

9f(v) = b 
dg 

The most general function of two Grassmann variables is written 

g(g, C) = a + 677 + cC + eg£. 

We can differentiate this to show that 

= b + e£, ^g{v, C) = c - eg. 


(28.5) 

(28.6) 
(28.7) 


We would also like to integrate Grassmann numbers. We define integra¬ 
tion over Grassmann variables via 


/ d77 77 = 1, / d?? = 0. 


(28.8) 


(Note again that the integration operator d ?7 is to be regarded as a 
Grassmann number.) As a result of the definitions, integration of our 
most general function /(??), given in eqn 28.3, yields 


J d 77 /( 77 )= J d? 7 a + j drjbii = b. (28.9) 


Notice that this is the same result that we obtained by differentiating. 
In fact it is a general result that integration and differentiation of Grass¬ 
mann numbers yield exactly the same result! 


Example 28.2 

Let’s check this assertion by integrating the function g(rj, £). Integrating with respect 
to 77 , we have 

J drj{a + br) + < + er jQ = b + e£, (28.10) 

while integrating with respect to £ gives us 

|dC(a + 6 , + cC + erjQ) = c — eg, (28.11) 

which is identical to what we had in eqn 28.7. 
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28.2 Coherent states for fermions 


One use of Grassmann numbers is in defining a coherent state that de¬ 
scribed fermions. Taking the creation operator for a fermion to be c 1 we 
define a fermionic coherent state as \rj) = e~ vc |0), where 77, as usual, is 
a Grassmann number. We can immediately simplify this using 77 2 = 0 
and write 

|r7) = |0) - r7|l). (28.12) 


In order for things to work out, we re¬ 
quire that Grassmann numbers like 77 
anticommute with fermion operators. 
For example, we require { 77 , c} = 0. 


Example 28.3 

We will check that a coherent state defined this way is the eigenstate of the fermion 
annihilation operator c. We have 


c| 77 ) = c|0) — C 771 1) = 0 + rjc\l) = 77 IO). 


But 


v\v) = »?|0) - )? 2 |1> = r/|0), 

and so c|);) = r/\v) as required. 


(28.13) 

(28.14) 


We can also define a state {r]\$ = (r]\ri , where 

(f}\ = (0| - (1| fj= (0| +r?(l|. 


(28.15) 


Note that 77 is not the complex conjugate of 77 and (77 1 is not the adjoint of 

\rf). With these definitions it follows 3 that the value of an inner product 3 See Exercise 28.1. 

(Cl??) = e c \ (28.16) 

Finally, for these coherent states to be useful, we require that they form 
a complete set via the completeness relation 


dyydry e 7 ? 77 1 77) (77 1 = 1 . 


(28.17) 


28.3 The path integral for fermions 

We start by evaluating the Gaussian integral for coherent states 4 

J drjdfje f ' ari = J diydiy (1 + 77077) = J d77(177 = a. (28.18) 

This carries over to the case of Grassmann-valued (TV-component) vec¬ 
tors 17 = (771,772, 7?iv) and 77 = (771,772,ipv) and we find 5 that for a 

matrix A: 

f d N r]d N fj e fjAl7 = det A, (28.19) 

where d N r)d N fj = dyidfjidr]2dfj2---dr]NdfjN. 

Finally, we can work out the most important integral of all. Starting 
with Grassmann numbers if and 77, we will want to integrate an equation 
of the form 

I diM^e^’+^+’K (28.20) 


1 T 1 ad these been ordinary c-numbers, 
we would have 


/ 


dzdz e 


— z az 


= —, (Re a > 0), 
a 


where dzdz* represents the indepen¬ 
dent integration over real and imagi¬ 
nary parts of z. While the factor of 7 r 
is neither here nor there, the important 
point to note is that the a comes out in 
the numerator for the Grassmann ver¬ 
sion. 

^See Exercise 28.3. 
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6 Recall that we originally 

solved the jmth integral problem 

I = f dxe 2 ~ + bx by completing the 
square, writing 



allowing us to conclude that I = 



We play the usual trick of completing the square 6 and we find 

$ Kip + fj'ip + ijjr] = (^ + 7 /iW 1 ) K (if + K~ x 77 ) — fjK _ 1 ? 7 , (28.21) 

which enables us to conclude that 

J di(>d$ = (28.22) 

where the number C (which will be proportional to a determinant for 
the vector case) will later be removed by normalization. (Remember 
that most of our problems only require part of the solution: the part in 
the exponential is important and the multiplying prefactor far less so.) 

We can immediately extend this result to the functional integral we 
need. Our goal is to find the generating functional for fermions 



'Dip’D^ e' 7 d M 


(28.23) 


7 r> 

Recall that to normalize we define 

Z[i ?,)?] = Z[ff,rj\/Z[ 0 , 0 ]. 


If necessary we massage the Lagrangian into the form 

4>(x)Kif>(x ), with K a differential operator, allowing us to read off the 
answer that the normalized 7 generating functional is given by: 

Z[f},rj\ = e -^)K~\*,y)v(y) (28.24) 


As before, we can then read off the propagator as i K 1 and the quanti¬ 
zation process is complete! 


Chapter summary 

• Grassmann numbers anticommute and can be used to describe 
fermions. 

• The fermion coherent state can be written 1 77 ) = |0) — 77 ( 1 ). Such 
states can be used to construct the fermion path integral. 


Exercises 


(28.1) 

(28.2) 

(28.3) 


From the definitions of the coherent states 1 77 ) and (28.4) 
(C|, prove that (C|»?) = e^ 17 . 

Find (a) f df/dr/r/fj and (b) f djydjyipp 
Show that f d 2 r/d 2 f/e f ' Arl = detA, where rj = 

, V = ( fji f )2 ), d 2 ? 7 d 2 f; = dr/idfjidr]2dfi2, 

and A is a 2 x 2 matrix. 


Prove that 

J dridfje^lfj}^] = | 0 )< 0 | + | 1 )( 1 |. (28.25) 

and may therefore be taken as a resolution of the 
identity. 

(28.5) Verify eqn 28.21. 




Part VII 


Topological ideas 

In many cases in quantum field theory we are concerned with working 
out some integral over spacetime coordinates and therefore the precise 
geometry of spacetime is important. However, there are some problems 
where the problem is entirely insensitive to the shape of spacetime, that 
is it has no way of sensing the spacetime metric. In this case, the theory 
will give an answer which is some kind of invariant which nevertheless 
does depend on some global property of the topology of the spacetime. 

• In Chapter 29 we introduce some simple topological ideas and use 
them to describe simple topological objects which can exist in field 
theories. These include kinks and vortices. 

• Topological field theories are discussed in Chapter 30 and we pro¬ 
vide some concrete examples of applications of these ideas by dis¬ 
cussing a nyons, fractional statistics and Chern-Simons theories. 
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1 Newton-John’s grandfather was Max 
Born (1882-1970), one of the found¬ 
ing fathers of quantum mechanics. The 
quoted lyric is by John Farrar. 



Fig. 29.1 A cup can be continuously 
deformed into a doughnut (a torus). 


You better shape up 

Olivia Newton-John (1948- ), Grease 1 

In this chapter we meet a new class of excitation in the field known 
as a topological object. These are configurations of the field which 
may exist in systems that show spontaneous symmetry breaking. As 
discussed in Chapter 26, when a symmetry is broken we should watch 
out for objects known as defects that reflect the possibility that a field 
can break symmetry in different ways in different regions of space. The 
objects we will encounter in this chapter are examples of such defects. 
We will discuss the two simplest topological objects: kinks and vortices. 
Both of these may be thought of as time-independent lumps of energy 
which are held together by interactions. An important additional feature 
of vortices is that they’re not stable alone and we must add gauge fields 
in order for them to be realized. As always, the presence of gauge fields 
provides the theory with some particularly interesting properties. We 
begin with a crash course on basic topology. 


2 

This is why projections of the Earth’s 
surface on maps usually involve a cut 
through one of the major oceans. 



Fig. 29.2 A product space is con¬ 
structed by attaching a copy of one 
space to every single point in another 
space, (a) A product space of the real 
line (R) with itself produces the two- 
dimensional plane R 2 . (b) The product 
of two circles (S' 1 ) produces the torus 
T 2 . A cup can be continuously de¬ 
formed into a doughnut (a torus). 


29.1 What is topology? 

The doughnut and the coffee mug are topologically equivalent because 
one can be continuously deformed into the other (Fig. 29.1). We imagine 
that everything is made of some kind of mouldable clay and so all objects 
can be pressed and prodded so that their shape changes. However, 
you are not allowed to puncture or heal, to add or remove holes, so 
that neighbouring points in the object remain neighbouring after the 
deformation. Topological spaces are given mathematical names: the 
real line is denoted R, so the plane is M 2 and the n-dimensional version 
is R”. One can define a product space in an obvious way, and so 
R x R = R 2 [Fig. 29.2(a)]. Similarly R x R x • • • x R = R". A segment 
of a R joined head to tail gives a circle, denoted S' 1 . The sphere is 
S 2 . (Note that this is a two-dimensional space, because by ‘sphere’ 
mathematicians mean the surface of a ball, not its interior.) The sphere 
cannot be embedded in R 2 (you can’t fully represent a sphere on a piece 
of paper without cutting it 2 ) but can be embedded in R 3 . (Similarly S 1 
can’t be embedded in R.) The torus T 2 (back to the doughnut again) 
can be constructed using the product T 2 = S 1 x S 1 [Fig. 29.2(b)]. 

Once we have a topological space, we can define a path through the 
space as a mapping f(x) of the real-line segment [a, b ] (shorthand for 
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a < x < b) to the topological space. If /(a) = f(b), the path becomes a 
loop. Paths can be continuously deformed into each other in an obvious 
way and one finds that in a particular topological space the set of possible 
loops can be divided up into a number of classes and these can be mapped 
onto a group (called the fundamental group and given the symbol 7Ti). 
For example, in R™ all loops are contractible (i.e. can be continuously 
deformed to a point) and so there is only one class of loop and is a 
trivial group just consisting of the identity element. It’s more interesting 
if the space has a hole in it, so that the loops can be divided into classes, 
each one characterized by the number of times the loop winds round the 
hole. This integer is known as the winding number. Thus 7Ti = Z, the 
set of integers. Further examples are 7Ti(S' 1 ) = Z and 7 Ti(T 2 ) = ZxZ 
(see Fig. 29.3). 

The crucial point about topological arguments is that they do not rely 
on any notion of the geometrical structure of space. Topological spaces 
are continuously deformable and therefore a topological description re¬ 
flects the underlying ‘knottiness’ of the problem. 


Example 29.1 

A thin solenoid (radius R , aligned along e z ) inserted into the two-slit experiment 
(see Fig. 29.4) gives rise to a magnetic vector potential outside the solenoid equal to 
A 9 = BR 2 /2r and inside A d = Br/2. The magnetic field B = V X A is then equal 
to zero outside and (0,0,5) inside the solenoid. 

Even though B = 0 outside the solenoid, the electronic wave function can still 
be affected. This is because a plane wave ip oc e ip ' r / h and in the presence of A the 
momentum p —> p — q A and so over a trajectory ip acquires an extra phase factor 
e 1 ^ where Aa = — ^ f A dr. This might not seem to matter because if you perform 
a gauge transformation A —> A + V% then Aa = — ^ f * dr and so Aa can be 
anything you want, all you need to do is to pick a particular form for However, 
when we consider the two-slit experiment, we are interested in the phase difference 
A 5 between the two paths which is given by 

AS = Aa± — Aa 2 = — — / A • dr + — / A • dr 

fa Ja—*l—>b fa Ja—>2—*b 

= |^Adr, (29.1) 

where the final integral is carried out anticlockwise around the trajectories. However, 
we may use Stokes’ theorem to write 

AS = - f V x A ■ dS = - f B ■ dS = -<S>, (29.2) 

hj hj h 

where 4> is the flux through the solenoid. This leads to a shift in the interference 
pattern on the screen and is observed experimentally. Gauge transformations make 
no difference here because § Vy • dr = 0. Remarkably we expect a nonzero gauge- 
independent phase difference A(5 even though electrons only pass through regions 
without magnetic field. The effect is known as the Aharonov—Bohm effect and 
is topological. The electronic wave function is defined on the plane R 2 minus the 
origin (where we place the flux), i.e. a sheet with a hole in it. Electromagnetism has 
U( 1) symmetry, which has the same topology as S 1 (the phase can be defined on a 
circle in the Argand diagram). Thus to define a phase everywhere on a sheet with 
a hole in it amounts to mapping S 1 on to a path around the hole. These fall into 
disjoint classes (labelled by an integer winding number, because ni(S 1 ) = Z) and 
these cannot be deformed into each other. 



Fig. 29.3 Loops on a surface of a 
torus T 2 can be contractible (contin¬ 
uously deformable to a point), such as 
C, or non-contractible such as A and B 
(each of these winds round once). Any 
loop can be described using two wind¬ 
ing numbers, the number of times the 
loop winds round like A and the num¬ 
ber of times like B. In topological par¬ 
lance: 7 ti(T 2 ) = Z x Z. 



Fig. 29.4 The Aharonov-Bohm effect. 
Electrons diffract through two slits but 
their paths enclose a solenoid. The 
field inside the solenoid is 5, but the 
electrons only pass through regions in 
which there is no magnetic field. 


Yakir Aharonov (1932- ) 
David Bohm (1917-1992) 
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Fig. 29.5 The double well potential. 


<t>( X ) 

A 


J 

1 (j)(x) — V 






Fig. 29.6 The uniform ground state of 
the field in a system with broken sym¬ 
metry. 


0 ) 



Fig. 29.7 The kink solution. 


Q 

Notice that the kink has energy in¬ 
versely proportional to the coupling 
constant. This is significant since it 
implies that an expansion in powers 
of the coupling constant, of the sort 
we make in perturbation theory, would 
never predict such a field configuration. 
This is one of the hallmarks of topo¬ 
logical objects: they are fundamentally 
non- p ert ur bat i ve. 


29.2 Kinks 

Consider the Lagrangian C = |(<9 At 0) 2 — U{<j>) in (1 + l)-dimensional 
spacetime, with U((f>) = j(v 2 — </> 2 ) 2 . This is just our symmetry breaking 
Lagrangian for </> 4 theory as can be quickly seen by expanding the bracket 
and setting v 2 = The potential, shown in Fig. 29.5, has two minima 
with U(</>) = 0 at <j>(x) = ±u. The vacuum of the theory is to be found 
at <f>(x) = v or 4>(x) = —v. We know that one way to work out what sort 
of particles live in this potential is to expand about one of the vacua. 
The excitations are then predicted to be particles of mass m = (2Xv 2 )^. 
The point of this chapter is to tell you that objects other than particles 
can live in the potential too. These are stationary configurations of the 
field <j>{x) whose energy density \{di<f>) 2 + [/(</>) goes to zero at x = ±oo, 
but which do something non-trivial in between. 

Let’s examine a field which has zero energy density at x = ±oo. Such 
a field must be stationary and occupy a zero of the potential at ±oo. A 
boring way of achieving this is for (j>(x) = v as shown in Fig. 29.6. The 
configuration (j>(x) = —v is similar. This is just symmetry breaking: 
the system falls into a ground state of <j> = v or <j> = —v. However, 
there is a more interesting configuration of the field. What if the field 
takes (f>(— oo) = —v and </>(oo) = i>? Then we must have something 
like the object shown in Fig. 29.7. This configuration is known as a 
kink and involves the curious state of affairs of half of the field living in 
one vacuum and half in the other. Notice that the field has broken the 
symmetry in two different ways. At x —> oo the field takes on the value 
<f> = v, whereas for x —> — oo it takes on value </> = —v. We might say 
that the fields at ±oo live in difference vacua. To find out whether the 
kink is stable, we may evaluate its total energy. If the energy is finite 
then we’re in luck as such configurations are allowed to exist. 


Example 29.2 

We will evaluate the energy of a kink. We take the kink to be stationary in time, so 
doj> = 0. We want to work out E = f dx [I(9j <j>) 2 + U(o)] . From the equations of 

motion we have that ^, which may be integrated to give § = H (</>)• 

We therefore have an expression for the energy given by 


E = 




[ dx - ( - 

J [2 \dxj 

/ dx2 [/(</.)=/ d(j>—2U(<t>) 

J — 00 J — v 

f V tfo [2 U(<j>)]i . 

J —V 


Plugging in [2 2 = (\/2) 1 / 2 (v 2 — (f> 2 ), we obtain the energy 3 

1 4m 3 


E = 


V2 3A 


(29.3) 


(29.4) 


This is finite and the kink is therefore allowed to exist! 
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The kink has a finite energy and, by construction, is a time-independent 
field configuration. It is also of a finite size l (see Fig. 29.8), the extent 
of which is determined by a balance of kinetic and potential energy. The 
kinetic energy’s contribution f dx^(d\ (f>) 2 ss l x ( v/l ) 2 tries to smear the 
kink out, but the potential contribution f da; U ss An 4 / tries to limit the 
region over which the field changes. The minimum energy is then to be 
found when the size of the kink is l ss (An 2 ) - ! = 1/m, where m is the 
mass of the conventional particles of the theory. 


E(x) 



|_| Fig. 29.8 The energy density of a kink 

is localized in an interval l. 

Example 29.3 

The domain wall in a ferromagnet has much in common with the kink described 
here. For the ferromagnet the vacua correspond to all spins pointing up ( <j> = v) or 
down (</> = —v). The kink corresponds a region where the fields change from up to 
down over a finite distance. This separates two magnetic domains. A domain is a 
region where the symmetry is broken in a particular way. It is, for example, part of 
the Universe where the vacuum corresponds to all spins pointing up. Domain walls 
therefore separate regions with different vacua. 

Domain walls are real and their existence may be detected in magnets through 
a variety of means. In a magnet, domains allow a magnetic system of finite spatial 
extent to save energy by reducing the field density in free space. However, the domain 
wall might be expected to exist on the more general grounds discussed here. 

Also interesting is an idea of Nambu’s that in reaching its final state the Universe 
has broken various symmetries. This, he suggests, might imply that domains with 
different vacua may exists across the Universe. 


We have a field configuration with finite energy above the ground state 
which is of finite spatial size. The theory is translationally invariant: 
the kink’s centre can be anywhere. The theory is Lorentz invariant: the 
kink can be boosted to an arbitrary velocity. The kink is, therefore, 
very much like a particle. In addition, the kink is stable: any attempt 
to remove the kink involves lifting a (semi)infinite length of field from 
one of the potential minima to another. This costs an infinite amount 
of energy and is therefore impossible. A more mathematical statement 
of this is that the kink is not deformable. We cannot hold the ends 
of the kink tightly and then remove the part that crosses the axis in 
between. The one way of annihilating a kink would be to have an anti¬ 
kink: an object whose field crosses the axis in the opposite direction so 
that <j)(— oo) = v and </>(oo) = —v, as shown in Fig. 29.9. The kink and 
antikink pair is deformable. If we hold the ends of the field tightly we 
see that we can continuously change the part that crosses the axis to 
remove the parts that cross the axis. We end with a field lying in the 
vacuum at <p = —v. 

This stability of kinks is encoded in a description of its charge. A 
charge, remember, is designed to be a conserved quantity. A single kink 
should carry a topological kink-charge of Q t = 1; an antikink a kink- 
charge of Q t = — 1. In order to come up with a sensible recipe for 
finding the kink-charge, we define the kink-current 4 as 

(29.5) 



Fig. 29.9 A kink and antikink. 


^ Note that this is different to the 
Noether currents that we have consid¬ 
ered so far. Its existence owes nothing 
to a symmetry of the Lagrangian. 
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c 

Remember that our convention is not 
to treat this as a tensor and so = 


^To summarize the argument so far, the 
theory 

c= - 2 {d^) 2 -U(4>), 

with 

u{4>) = 

allows the existence of kinks. These are 
topological objects which behave rather 
like particles. The field at x = oo and 
x = — oo live in different vacua. 



Fig. 29.10 Some vortices with winding 
number n equal to (a) 1; (b) 1; (c) — 1; 
(d) 2. 

The phase ip can be chosen to globally 
alter the directions at infinity. So, for 
example, Fig. 29.10(a) corresponds to 
ip = 0, while Fig. 29.10(b) corresponds 
to cp = 7r/2. This can also be absorbed 
into the complex amplitude K. 


where e^ v is the antisymmetry symbol, whose signs are determined by 
knowing that e 01 = 1. This leads to an expression for the charge of a 
kink of 


Q T 


1 


/ d.r r = — 

J—oo J — oo 

^[<Hoo) - </>(-oo)] = 1 


dct> 
da: — 
ox 


(29.6) 


whereas an antikink has Qt = — 1- Notice that the phions of the theory 
do not have a kink-charge since Q t = <f>( oo) — </>(—oo) = 0. We call the 
kink-charge a topological charge, which explains the subscript. Its 
existence is independent of the geometry of spacetime. That geometry 
is encoded in g ^, which is an instruction book telling us what watches 
and measuring rods measure in spacetime. In contrast the topological 
charge has its indices summed with the antisymmetric symbol 5 e**". The 
dependence on e ltv rather than g^ u turns out to be a general feature of 
topological objects and is examined in more detail in the next chapter. 6 


29.3 Vortices 

Now we move to (2 + l)-dimensional spacetime and examine the same 
Lagrangian. Our potential now resembles a Mexican hat, whose minima 
describe a circle in the (j>i-(j )2 plane. It will be convenient to work in polar 
(internal) coordinates (r, 9) and write (j>{x) = (j>\{x)+i(j) 2 (x) = r{x)e l6 ^ x \ 
In the case of the kink we found a topological field whose two ends 
lived in different vacua at spatial \x\ —» oo. By analogy we ask what a 
continuous field would look like whose elements live in different vacua 
at spatial infinity. Some examples of such configurations are shown in 
Fig. 29.10. These objects are known as vortices. They are described by 
a field whose form at infinity is written 

(|aj| —> oo), (29.7) 

where 9(x) = tan _1 (a; 2 /a: 1 ) and <p is a constant, arbitrary phase. 7 Notice 
how this equation relates the direction in which the field </> points in the 
internal complex plane to the angle 8 that tells us where we are in 
coordinate space. Here n is known as the winding number; it tells us 
how many times the field winds around as we travel around the circle 
at spatial infinity. 


Example 29.4 

We may evaluate the energy of a vortex. For a static field configuration we have a 
Hamiltonian 

H = i V0 f • V<£ + U((j)) , (29.8) 

and we take = (|if| 2 — <$<!>)* ', so that U((j)) = 0 on the boundary at infinity. 

The gradient of the vortex in cylindrical (spatial) coordinates is 

V</> = -(i nKe inB )e e . 


(29.9) 
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The core of the vortex looks frighteningly singular, but we will assume it is ultimately 
well behaved and call the core energy E CO re(o,), where a is the core size. We evaluate 
the energy of the rest of the configuration at distances larger than r = a and find 

y ^*oo P oo 

drddr'H = E COIe + nn 2 \K\ 2 dr-. (29.10) 

a J a r 

This is a logarithmically divergent energy. We conclude that a single vortex is not a 
stable object. 


The single vortex is unstable. This is understandable if we look at the 
form of the field at large distances (Figs. 29.11 and 29.12). It is still 
swirly as r —» oo. Actually this is predicted by a general theorem due to 
G.H. Derrick, which says that time-independent topological objects are 
impossible for our theory in more than one spatial dimension. 

To stabilize the vortex we must tamper with the Lagrangian. The way 
to do this is to add a gauge field. Recall that adding a gauge field (known 
as ‘gauging the theory’) is done by introducing a covariant derivative as 
follows: 

Dfj.cj) = d,j,(j) + iqA^cj). (29.11) 

In order to cancel the divergence of the vortex we’re going to choose our 
gauge field in such a way that it cancels the divergent part of the vortex 
energy, which resulted from the derivative of the field configuration. 
We therefore want the covariant derivative D^(j> to vanish at infinity. 
A gauge field that does this is one whose limit as r —> oo tends to 
A(r,0) = ±V(n0). 


y 



Fig. 29.11 Fields at large distances 


for an n 

= 1 vortex. 
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Example 29.5 

The components of the proposed gauge field at infinity are 8 
A r —► 0, Ao —> — — as r —> oo. 

’ u qr 

As r —> oo the components of the covariant derivative become 


Fig. 29.12 The vortex field configura¬ 
tion for an n = 1 vortex. 


(29.12) 


^ Recall that A — A 1 = —Ai in follow¬ 
ing the sign changes here. 


D r <j) = 0, D e <j) — ~ ^ + i qAg(j) = 0 as r —» oo. (29.13) 

This shows that the covariant derivative (which represents the ‘kinetic energy’ of 
the field) vanishes at infinity. Since it was this term that diverged for the ungauged 
theory, we see that A has saved the day. 


The gauge field should also be expected to make a contribution to the 
Lagrangian of the form — jF^F^. However, this is not the case here. 
Usually we have a gauge field which can be changed by a gauge trans¬ 
formation which adds a contribution d^x- I n our case we notice that at 
infinity A = Vy, where x = \ n ®' That is, our gauge field is entirely 
formed from a gauge transformation and is therefore another example 
of the pure gauge which we first saw in Chapter 26. This means that 
F^u = 0 and the gauge field cannot result in a nonzero —\F 2 contribu¬ 
tion to the Lagrangian. This is also a relief, since such a contribution 
might also threaten to blow up, ruining the stability of the vortex. 
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What then is the contribution of the gauge field to the physics? The 
answer is found by carrying out the integral j> A ■ dl around a circle at 
infinity. From Stokes’ theorem we know what this means physically: the 
integral is j> A ■ dl = J B ■ dS = <f>, that is the magnetic flux through 
the circle. Carrying out this integral we obtain 

$ = £A-dl = A e rd6 = J d6™ = 2 ^. (29.14) 

We see that vortices carry quantized magnetic flux! 

The next step in this line of argument to examine a topological object 
in (3+l)-dimensional spacetime. Remarkably, this object turns out to 
be a form of a magnetic monopole and is examined in Chapter 49. In the 
next chapter we turn our attention to another aspect of topology and 
describe a field theory that can be legitimately described as topological. 


Chapter summary 

• Topology relates the structure of objects that are preserved un¬ 
der continuous deformations involving stretching but no cutting or 
gluing. 

• Topological objects include kinks and vortices and can be charac¬ 
terized by a topological charge. 


Exercises 


(29.1) Show that the antikink has kink-charge Q t = — 1 
and that two kinks and three antikinks have a total 
charge of Qt = — 1. 

(29.2) This is the first time we have used the antisymmet¬ 
ric symbol £in anger. We regard the up and down 
versions as identical since we are not using it as a 
tensor. Let’s examine its properties. Take £ 01 = 1 
and e 012 = 1 and show the following: 

(a) e ij e ij = 2. 

(b) £ ij £ in = 5 jn . 

(c) £ ij £ mn = SjnSim ~ 5 in S jm . 

(d) £«*£*"" = SjmSku - 8j„5km- 

(29.3) A simple model of a domain wall in a magnet 
says that the interaction energy of two neighbour¬ 
ing (classical) spins of magnitude S is given by 
E = — JS 2 cos 8 where 6 is the relative angle be¬ 
tween the spins and J is a constant. 

(a) Show that the energy of a domain wall sepa¬ 


rating a domain of spins up and a domain of spins 
down costs energy 

u = \ js2 i ’ (29 - 15) 

where N is the number of spins in the wall. 

(b) Show that this picture will not lead to a stable 
wall. 

(c) In reality spin-orbit effects lead to an energy 
cost for not having spins pointing along certain spa¬ 
tial directions. In this case these lead to a contri¬ 
bution to the energy of Show that the wall is 
now stable and find its size. 

(29.4) (a) Draw vortices with winding numbers n = —2 
and n = 3. 

(b) Draw the fields resulting from vortices with 
n = 1 and n = — 1 sharing the same region of space. 
This situation is discussed in Chapter 34. 
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Imagine the perplexity of a man outside time and space, who 
has lost his watch, his measuring rod and his tuning fork. 

Alfred Jarry (1873-1907), Exploits and Opinions of Doctor 
Faustroll, Pataphysician 

So far we have met topological objects that can exist in scalar field the¬ 
ories with a broken symmetry. In this chapter we will take the study 
of topology further and cook up an inherently topological field the¬ 
ory. This theory will apply to the special case of (2+l)-dimensional 
spacetime. We will see that physics in the flatland of (2+1) dimensions 
has some remarkable properties whose origin may be traced back to 
topological considerations. 

We start our discussion by examining the sorts of particle statistics 
that apply in two spatial dimensions. We will see that flatland supports 
anyons: particles that are neither bosons nor fermions. We will then 
go on to formulate a field theory that not only captures anyon statistics, 
but also describes some curious new electrodynamics. 


30.1 Fractional statistics a la 
Wilczek: the strange case of 
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30.1 Fractional statistics a la Wilczek: the 
strange case of anyons 

We are quite used to the idea of bosons and fermions as entities distin¬ 
guished by their behaviour upon exchange of identical particles. Specif¬ 
ically, changing the labels on two identical particles results in a change 
of the wave function according to the rule 


tp(x 1 ,x 2 ) = ±ip(x 2 ,x 1 ), 


(30.1) 


where the + sign applies to bosons and the — sign to fermions. 

For the case of two-dimensional space, the exchange of particles needs 
more careful attention. In fact, we need to think more precisely about 
what the exchange of particles actually involves. We do not simply 
make the two particles vanish and then reappear in different positions. 
Rather we imagine the process in which particles are moved around 
each other in real space. Let’s identify two moving-arouncl processes. 
We start with two identical particles at positions aq and x 2 and identify 
two fundamentally distinct ways of exchanging them. The result of 
processes of type A is to move aq —> aq and x 2 —> x 2 . Some examples 
are shown in Fig. 30.1(a) and (b). In this sort of process the particles 




268 Topological field theory 


(a) (b) 



Fig. 30.1 Examples of ways of ex¬ 
changing particles, (a) and (b) are type 
A processes, where the particles end up 
at the same positions. In case (b) one 
particle loops around the other once in 
the exchanging process, (c) and (d) are 
type B processes where the particles ex¬ 
change positions. In case (d) one parti¬ 
cle loops once around the other during 
the exchange. 


Frank Wilczek (1951- ) was a co¬ 
recipient of the Nobel Prize in Physics 
in 2004 for their discovery of asymp¬ 
totic freedom. His popular book The 
Lightness of Being (2008) is a very read¬ 
able account of modern ideas in particle 
physics. 


end up where they were originally, although they may move around each 
other. The result of type B processes is to move x\ —> X 2 and X 2 —► Xi 
[Fig. 30.1(c) and (d)]. Again particles may move around each other 
several times before settling at their final positions [as in Fig. 30.1(d)]. 
This is particle exchange, albeit in a more tangible form than the usual 
magic trick method. 

We ask what the relative quantum mechanical phase difference is be¬ 
tween processes of type A and type B. The key parameter to consider 
is the angle that one particle is moved around the other. This is where 
topology comes in: given a set of particle paths, we may smoothly dis¬ 
tort the paths of the particles, but we may not change the number of 
times particles wrap around each other without introducing singularities 
in the particle paths. Processes of type A involves rotating particle 2 
around particle 1 by angle 0 = 2np, where the winding number p takes 
an integer value (including zero). Processes of type B involves rotations 
of 0 = 7t(2 p + 1). Each value of p describes a topologically distinct 
process. 

We suppose that these topologically distinct processes make a mul¬ 
tiplicative contribution to the total wave function (or, in Held theory, 
to the path integral) of $(0) which we expect to be pure phase. If we 
carry out a sequential string of these processes then we require that the 
angles add, whilst the wave functions should multiply. That is, we need 
$(<£1 + 02 ) = $( 0 i)$( 0 2 ) which implies that $( 0 ) = e 1?? ^ where 77 is a 
parameter which, crucially, doesn’t need to be an integer. 

Now we compare our real life exchange to the old-fashioned definition 
in eqn 30.1. If we carry out the exchange (xi,x 2 ) —> (x 2 ,xi) [shown in 
Fig. 30.1(c)] then the formal definition of exchange embodied in eqn 30.1 
tells us that the wave function should be identical (for bosons) or pick 
up a minus sign (fermions). However, the realistic version of exchange 
merely tells us that 0 = 7 r, resulting in a phase factor of <f>(0) = e lr,7r . 
The two versions of exchange are only identical for the special cases that 
(a) we have 77 = even integer, when we recover the expected exchange 
behaviour for bosons or (b) we have 77 = odd integer, when we recover 
fermion exchange. However, this analysis shows that there are many 
more possible values of rj in two dimensions. It doesn’t have to be 
an integer! We are therefore not tied simply to bosons and fermions, 
the freedom to choose 77 means we can have any exchange statistics. 
Particles with such statistics were dubbed anyons by Frank Wilczek, 
who has made important contributions to elucidating the study of these 
particles and their ‘fractional statistics’. 

Notice that having only two spatial dimensions to play with was vital 
to the argument. In three spatial dimensions all of the type A processes 
are topologically identical since they are all deformable into paths where 
the particles don’t move. Similarly all of the type B processes are topo¬ 
logically identical and may be reduced to a simple exchange of particles. 
This reduction occurs because the extra dimension allows us to move 
the paths past each other in the third dimension, shrinking all loops to 


zero. 
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30.2 Chern—Simons theory 

We now turn to the problem of writing down a Lagrangian that can be 
described as topological. What do we mean by this? Our usual La- 
grangians have relied on the use of the metric tensor g which is an 
object that tells us how to measure displacements in space and time. 
It provides a set of instructions for taking the scalar product of vectors 
through a contraction of indices. In contrast, in a topological theory 
the contraction of indices is carried out through the use [in (2+1) di¬ 
mensions] of the antisymmetric symbol . Topological theories are 
therefore blind to the details of the watches and measuring rods that 
are contained in g^ u . Their content is dependent purely on the topol¬ 
ogy of the manifold 1 in which we’re working. The first consequence of 
this feature is a shock when we try to derive the Hamiltonian from a 
topological Lagrangian: the Hamiltonian vanishes! 


Example 30.1 

To see this we note that a general method of finding the energy-momentum tensor 
is to vary the action with respect to the components of the metric. For an action 
Stop derived by integrating a topological Lagrangian we have 




— 2 S Stop 

V~ det g 5g^ u 


(30.2) 


Since topological theories don’t include at all then we must have = 0. Since 
the Hamiltonian is the 00th component of this tensor then we conclude that H = 0. 

A consequence of this is that all of the states of a topological Hamiltonian have 
energy E = 0. So there is a significant degeneracy in the system. Unfortunately it 
is rather difficult to determine how many states we have since this depends on the 
topology of the manifold. 


We will now describe a topological term that may be part of a La¬ 
grangian. This is known as a Chern—Simons Lagrangian and applies 
to the case of (2+l)-dimensional spacetime. It is written 2 

C = —^K£ IJl/X a l jd, J ax, (30.3) 

where is a U( 1) gauge field and n is a constant. Chern-Simons terms 
are easy to spot since they have the form C = eada. We will see that 
Chern-Simons theory contrasts in many respects with the other U{ 1) 
gauge theory with which we’re familiar: electromagnetism. 


Example 30.2 

The Chern-Simons Lagrangian gives rise to a gauge invariant action. Actually the 
gauge invariance is slightly more subtle than the usual version seen in electromag¬ 
netism. To show this, we make the gauge transformation + d^Xi an d we 

obtain 

e^ uX a^d u a\ —> e tJ,uX a IJ d u a\ + (30.4) 


^A manifold is a topological space 
that locally resembles Euclidean space. 


Chern-Simons theories were first intro¬ 
duced into field theory by Edward Wit¬ 
ten (1951- ) based on work in differen¬ 
tial geometry. They are named after 
Shiing-Shen Chern (1911-2004), math¬ 
ematician, and James Harris Simons 
(1938- ), mathematician, hedge fund 
manager and philanthropist. 

2 

Notice that the Chern-Simons theory 
will work in (2+l)-dimensional space- 
time but not in (3+1) dimensions. Try 
writing it in (3+1) dimensions and 
you’ll see that the indices won’t match 
up. 



Here we have used the fact that e^^d^duX = 0? which follows from the antisymmetry 
of The resulting change in the action (that is, the second term in the above 

equation) may be written as a total derivative 

68 = j cT^^^( xcUa). (30.5) 

This vanishes under the assumption that we may drop boundary terms. We conclude 
that the Chern-Simons Lagrangian is gauge invariant, as long as we discard the 
boundary terms. 


The next thing to note is that if we plug this term into the Euler- 
Lagrange equations we obtain an equation of motion for the fields 
as 

K£^ uX d u a\ = 0, (30.6) 

or defining a field strength tensor f llL/ = d^a^-d^a^ we have an equation 
of motion \n£^ x f v \ = 0 or 

fHu = 0. (30.7) 

This looks very unexciting! Unlike electromagnetism, for which, in the 
absence of sources, we have an equation of motion d^F^ 11 = 0 which 
supports plane waves, the Chern-Simons theory seems to have no dy¬ 
namics of its own. Nevertheless, the Chern-Simons theory will turn out 
to be interesting, but we just need to couple it to another field. 

Therefore, we now couple our Chern-Simons field a^ to a source 
which is the conserved current of some other field and write 

C = - ]^K£^ x a^d v a\ + a p (30.8) 

and this leads to an equation of motion 

= K£^ x d„a x = \K£^ vX f vX . (30.9) 

Clearly this expression for current isn’t a consequence of Noether’s the¬ 

orem; rather, it occurs as a result of our coupling a source to the Chern- 
Simons gauge field by hand. The Chern-Simons term therefore provides 
a constraint on the current. 

In order to understand the properties of the current we now point out 
an odd, but undeniably relevant feature of (2+l)-dimensional spacetime. 
This feature stems from the fact that, in two spatial dimensions, the 
cross product results in a pseudoscalar, rather than a pseudovector. The 
result is that the magnetic field in (2+l)-dimensional spacetime is a 
pseudoscalar: B = £ v ^diAF 


Example 30.3 

To see this we note that in three dimensions we have the definition of the cross 
product in component form V 1 = e l3k A 3 B k . In two dimensions we lose an index 
from our antisymmetric symbol and we have S = s 13 A 1 B 3 . We run out of indices, 
so the result of the cross product is a pseudoscalar as claimed. 
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Now turning to the components of the current J M = (p,J) predicted 
from eqn 30.9, we find 

p = K(d\a 2 — d 2 a\), J 1 = k(— d 0 a 2 + d 2 a 0 ), J 2 = n(d 0 ai - dia 0 ). 

(30.10) 

Although the field a M does not describe electromagnetism, we may use 
the language of electromagnetism and call b = d 2 a\ — d\a 2 the a M mag¬ 
netic field b and call (9o<+ — did o the ith component of the a^ electric 
field e 1 . We therefore have 

p = -nb, J 1 = -KE ij e j , (30.11) 

where e 12 = 1. The second of these equations tells us that an electric 
field in the y-direction gives a source current in the ^-direction. The 
meaning of the first equation is made clear if we integrate over the two- 
dimensional space to find that the charge Q of the source field is related 
to the flux of the a^ magnetic field by 

Q = —k j d 2 xb, 
or 

/ Charge of \ / Flux of \ 

source field J \ field J 

Thus we conclude the following: 

Chern-Simons theory ties a M -flux to the charge of the source field. 


(30.12) 

(30.13) 


30.3 Fractional statistics from 
Chern—Simons theory 


The fact that external source charge couples to a M -flux has an influence 
on the statistics we find on the exchange of particles. Recall that in 
(2+1) dimensions moving one particle completely around another gives 
rise to a phase contribution e 2mri , where ?y is an odd integer for fermions 
and an even integer for bosons. 

Now recall the Aharonov-Bohm effect from Chapter 29. Moving one 
charged particle completely around a source of flux gives rise to a phase 
contribution e 19 *, where q is the particle charge and $ is the flux. Using 
our Chern-Simons result that = —k$>, we obtain a prediction of the 
Aharonov-Bohm phase of e -1 ^. However, particle exchange (which 
tells us about statistics) is slightly different to this in that it involves 
moving one particle around the other through an angle 7r. Interpreting 
the above Aharonov-Bohm process as two lots of particle exchange (and 
therefore contributing a phase 27777 ) we obtain a result for the statistics 
of our particles, 3 namely that 

a 2 

2nr] = - —, (30.14) 

K 


3 

In fact, there’s another tricky factor of 
two here. One is tempted to say that 
since all particles carry both charge and 
flux then moving one charge around a 
flux also moves a flux around a charge, 
doubling the effect compared to what 
we have written. A more careful treat¬ 
ment confirms our naive picture of a 
single charge moving around a single 
flux and gives eqn 30.14. See Wen, 
Chapter 7 for further discussion. 
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and hence ?y = — ^7 is not necessarily an integer. This implies that the 
particles exhibit fractional statistics. 

This chapter has explained how to construct a topological Lagrangian. 
It is based on gauge fields; it supports excitations with fractional statis¬ 
tics; its charged excitations carry flux. In Chapter 45 we’ll examine a 
phase of matter which is described by a Chern-Simons theory: this is 
the fractional quantum Hall fluid. 


Chapter summary 

• Topological field theories such as Chern-Simons theory are built 
from products of gauge theories summed together using the anti¬ 
symmetric symbol in place of a metric tensor. These theories 
provide constraints on other fields to which they are coupled. 

• Chern-Simons theory attaches flux to the charge of the source field. 

• Chern-Simons theories may produce fractional excitations. 


Exercises 


(30.1) (i) Verify that e^ vX d^d v x = 0. 

(ii) Verify eqn 30.4. 

(30.2) Show that the Chern-Simons term won’t work in 
(3+l)-dimensional spacetime. Suggest a form for 
(4+l)-dimensional spacetime. 

(30.3) Derive the equations of motion for the Chern- 
Simons fields coupled to a source in eqn 30.9 using 
the Euler-Lagrange equations. 

(30.4) The propagator for Chern-Simons theory may be 
found in a manner very similar to that used in 
electromagnetism in Chapter 39. Starting with 
Chern-Simons theory we add a gauge fixing term 
— ^(d M a M ) 2 to obtain 

C = -^e^a^ax - ^(d^oC) 2 + J^a^. (30.15) 

(a) Show that the equations of motion for the field 
are given by 


which provides the propagator for the theory via 

i(M-V. 

(c) Show that the Chern-Simons term therefore 
leads to a term in the Lagrangian that may be writ¬ 
ten 

£ = jA - (30.18) 

This is known as the Hopf term. 

(30.5) Consider Chern-Simons electromagnetism de¬ 
scribed by a Lagrangian 

C = - ^ x a,d v a x - ^(d^) 2 , 

(30.19) 

where = d M a^ — d„<v 

(a) Find the equations of motion for this theory. 

(b) Show that the inverse of the equations of motion 
is given by 


r 


s^ x d„ - -*d M d A 


a x = M^ax- (30.16) 


(b) Show that the inverse of the equations of motion 
is given, in momentum space, by the matrix 

(M-^xa = ^ ex ^f (30.17) 


up* 


(AT 


p 2 (p 2 — ft 2 ) 


+c- 


p 

(30.20) 

What is the significance of the pole in the propaga¬ 
tor at p 2 = ft 2 ? 


(30.17) 




Part VIII 


Renormalization: taming 
the infinite 


This part is structured as follows: 

• Interactions in quantum field theories change the properties of the 
particles, producing dressed particles, also known as quasiparticles. 
These are examined in Chapter 31 which also gives an outline of 
Landau’s Fermi liquid picture. 

• Quantum field theory has some uncomfortable divergences and 
Chapter 32 shows how these can be tamed using renormalization 
using additional counterterms. 

• These ideas are applied to deriving propagators and making Feyn¬ 
man diagrams in Chapter 33. This allows us to write down a 
Green’s function called the self-energy which describes the change 
in the mass of a particle due to the interactions, and also a vertex 
function which describes screening. 

• The renormalization group is introduced in Chapter 34. Examples 
treated include asymptotic freedom, Anderson localization, and 
the Kosterlitz-Thouless transition. 

• Chapter 35 treats the ferromagnetic transition as a further exam¬ 
ple of the renormalization group procedure and shows how critical 
exponents can be extracted. 
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This chapter focuses on how the behaviour of particles is affected by 
interactions between them. We will see that the passage from a non¬ 
interacting theory to an interacting theory involves not only a propensity 
for particles to scatter from each other but, more dramatically, changes 
in the properties of particles themselves, such as their mass and charge. 
We will say that the values of these properties are renormalized. Fur¬ 
thermore, we will find that interactions even change what we mean by a 
‘particle’. The process of renormalization may be imagined as a particle 
dressing itself in interactions. 

A rather familiar example of interactions changing the apparent prop¬ 
erties of particles is the screening of a positive charge embedded in a 
metal. The positive charge builds up a cloud of electron density around 
it. Distant electrons do not feel its full charge because the nearby elec¬ 
tron density screens the charge, reducing its apparent value. A second 
example is the effective mass of an electron in a crystal. The fact that a 
crystal contains immobile ion cores which interact electrostatically with 
the electron means that the apparent mass of the electron is not that of 
the bare electron in a vacuum, but instead an effective mass to*. 

We will discover that analogous processes occur in all interacting quan¬ 
tum field theories. Of course the difference between condensed matter 
physics and a ‘fundamental’ theory such as quantum electrodynamics is 
that, for the former, we may measure the mass and charge of electrons 
both inside the crystal and in vacuo. In contrast, we cannot remove elec¬ 
trons from the vacuum of quantum electrodynamics to see the difference 
in their properties caused by interactions. However, the framework in 
which we understand the fate of interacting particles is the same in both 
cases and it is to this set of ideas which we now turn. 


31.1 Recap: interacting and 
non-interacting theories 

1 The simple scalar field theory with La- 

grangian C = \ (<9 M </>) 2 - is an Our simplest field theories describe particles that don’t interact. We call 

example of a free field theory. these free or non-interacting theories. 1 
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Example 31.1 

Here’s a recap of the properties of a non-interacting theory. For a theory described 
by a free Hamiltonian Ho, we define freely evolving field operators and a vacuum 
ground state. These may be defined through 

H o |0> = 0, p\0) = 0, (0|£(*)|0> = 0. (31.1) 

Particles are excitations of the vacuum. Creation and annihilation operators cre¬ 
ate single-particle modes, one at a time, labelled by their momenta. An example is 
dp|0) = |p). The amplitude for a field operator to create a (relativistically normal¬ 
ized) single particle with momentum p is given by 

(31.2) 


Finally, to talk about particles we need a propagator. This describes a single particle, 
inserted into the system at y and removed at x, and which, for a non-interacting 
theory, will not affect or be affected by any other particles in the system. The 
Feynman propagator describing the evolution of a free, single particle is given by 


Go(p) 


p 2 — m 2 + ie 


(31.3) 


If we didn’t know the mass of the particle before (from its dispersion relation Ep — 
p 2 = m 2 ) then we could read it off the propagator as the pole of the Green’s function. 


So much for the world of non-interacting particles. What happens when 
particles interact? Consider a Hamiltonian written as a sum of non¬ 
interacting and interacting parts 

H = H 0 + XH'. (31.4) 

In order to examine the interacting system, we will imagine what hap¬ 
pens to a free theory described by Hq when we slowly turn on an inter¬ 
action H'. To do this we multiply H' by a function A (T) which starts 
vanishingly small and slowly approaches unity as we turn up some pa¬ 
rameter T, as shown in Fig. 31.1. As A grows, our system gradually 
becomes a bubbling cauldron of interactions: the new eigenstates of the 
full Hamiltonian H (called | p x )) are different to those of Ho (called |p)). 
The interacting ground state Q) is different from the non-interacting 
ground state |0) and the dispersion of the particles is also altered. If 
we put a test particle into this system it will interact with particles and 
antiparticles, pulling them out of the vacuum. These particles and an¬ 
tiparticles may also interact with still more particles and antiparticles, 
doing all manner of complicated things. Our original particle may be 
lost in all of this havoc and it might be doubted that we can even iden¬ 
tify single particles in this system any more. We may have to abandon 
our quantum field theory which, after all, is based around the notion of 
creating and annihilating single particles. We are led to ask: are there 
such things as single-particle excitations in an interacting system? The 
point of this chapter is to show that there are! They are called dressed 
particles or quasiparticles. 


A 



Fig. 31.1 Adiabatic turning on of H'. 
The full Hamiltonian is given by H = 
Ho + \{T)H'. This thought experiment 
allows us to examine what happens to 
particles in an interacting theory. 



276 Renormalization, quasiparticles and the Fermi surface 


Quasiparticles are the excitations in interacting systems. They 
resemble free particles, but have different masses and interact 
differently with their environment. The process of free particles 
turning into dressed particles is called renormalization. 


31.2 Quasiparticles 

We imagine that as the interaction is turned on the particle begins its 
transformation from a single non-interacting particle into a quasiparticle. 
We say that the particle becomes dressed by interactions and becomes 
a quasiparticle. The process of renormalization is therefore embodied in 
the equation 

(Quasiparticle) = (real particle) + (interactions), (31.5) 

where the word quasiparticle may be substituted by ‘dressed particle’ or 
‘renormalized particle’, depending on the context. 

Let’s examine what happens to a free particle as the interaction is 
turned on. How do we know if it’s possible for an interacting quasi¬ 
particle to exist with the same momentum once we’ve turned on the 
interaction? To answer this, we act on the interacting vacuum with the 
particle creation operator <${x) oc f p a p e lp ' x + a p e lp ' x that we had for 
the non-interacting system. As we turn on the interaction the ground 
state is no longer |0), it’s now \ fl) (where H\fl) =0). The excited states 
are called | p A ), where A tells us that we have interactions. The key to 
understanding dressed particles is that the state aj,|f2) created by our 
field operator won’t generally contain one single-particle excitation. In 
fact, our operator aj, is a bull in a china shop: there’s nothing to stop 
the free field operator producing several excitations all at once! 

To make a single-particle excitation with momentum p in the free 
theory we acted on the free ground state with an operator aj, and said 
|p) = aj,|0). To make a single particle with momentum p in the inter¬ 
acting theory we might act on the interacting ground state Q) with a 
different operator |p A ) = gj,|f2). This all makes sense, in that we need 
to have the right tools for the job, that is, the right operators for the 
ground states, and we make the expected number of particles (i.e. one). 
Unfortunately we don’t know what q j, is and so can’t use it on the sys¬ 
tem. All we have is the single-particle creation operators aj, for the 
non-interacting states. 

But what happens if we use the wrong operator on the interacting 
ground state? If we apply the operators for the non-interacting theory 
aj, to the interacting ground state |0) we make a superposition of states. 
One might be the state we intend to make |p A ) , but others will be 
multiparticle states whose momenta each add up to p. That is 


aj,|f2) = |p A )(p A |aj,|fl) + (multiparticle parts). 


(31.6) 
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For example, aj,|f2), might contain a multiparticle state 2 with two par¬ 
ticles and an antiparticle like this: <Zp 1 < 7 p 2 <Zp 1 +p 2 -p|^)- 

In fact, it can be more complicated still, since we may only approxi¬ 
mately create a state \p x ) (along with multiparticle parts) and in actual 
fact create an ultimately unstable, narrow wave packet of width T p re¬ 
sembling this state. In particle physics this is known as a resonance. 

In this case the energy of the particle may have an imaginary part T p . 

A complex energy E p + ir p will cause a particle to time evolve accord¬ 
ing to e lEpt e~ Tpt or, in words, the particle is unstable to decay and has 
a lifetime 3 (2r p ) _1 . After a time that’s long compared to (2r p ) _1 we 
can’t expect there to be a quasiparticle any more. 

With all of these particles flying about and possibly decaying how can 
we say whether we have a single particle? We call the amplitude for 
making the desired single-particle part the quasiparticle weight Z p , 
where Z p = (p A |d|,|f2), and where | p x ) is a relativistically normalized 

momentum state. We say that we have a quasiparticle if Z p 7 ? 0. In the 
interacting theory, we can state that the amplitude in position space for 
creating a single particle with a momentum p is 

(p x \ft(xm = Zle^e-^. (31.7) 

In fact, to meaningfully describe a particle as a quasiparticle we will 
require that E p > T p , that is, its energy must be greater than its decay 
rate. Comparing with the analogous result obtained for non-interacting 
scalar field theory we conclude that, when T p is small, we can describe 
particles in an interacting theory by multiplying 4 our single-particle wave 4 In the jargon, one says ‘renormalizing’ 
functions by the factor z\ . “ ' ' rather than‘multiplying’. 

It’s important to note that the energies of interacting single particles 
are different from those of free particles. In the (relativistic) scalar 

field theory we’re describing 5 the interactions change the mass of the 5 In which the free particle dispersion is 

single-particle excitation from m (often called the bare mass) to m\, given by E p = (p 2 + m 2 )i. 

giving an energy for the state |p A ) of (p 2 +to 2 ) 2 . Since m\ is the mass 

of a single particle measured in our experiments, it’s often called the 

physical mass and given the symbol m p. The following table shows 

quantites for both non-interacting and interacting theories. 

Non-interacting: | p) = dj,|0) (p\<j>(x)\0) = e lp ' x m 

Interacting: |p A ) = (p\\cj>(x)\£l) = Z p e lp ' x e~ Tpt mp 


31.3 The propagator for a dressed particle 

For serious calculations we need to identify a propagator for the inter¬ 
acting system via our definition 

G(x,y) = (n|T0H(a:)^(j/)|fi). 


o 

Note that, in a metal, these multipar¬ 
ticle parts may be thought of in terms 
of the emission of electron-hole pairs. 
See Chapter 43. 


3 

Where the factor of two comes from 
considering the square modulus of the 
particle wave function. 


(31.8) 
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^ This can be achieved simply by sliding 
a resolution of the identity between the 
field operators in eqn 31.8. This reso¬ 
lution will be made up from the exact 
eigenstates Ip^) of the full Hamiltonian 
H. For details of the calculation, see 
Peskin and Schroeder, Chapter 7. 


In most cases involving long-lived ele¬ 
mentary particles Z p is independent of 
the three-momentum p and is simply 
denoted Z. As we shall see, a depen¬ 
dence on p is more relevant for metals 
in solids. 


7 r n 

Recall that the free scalar field propa¬ 
gator A (x,y, M 2 ) can be written in mo¬ 
mentum space as 


A(p, M 2 ) 


1 

p 2 — M 2 + ie' 


(a) 

p(M 2 ) 



(b) , 

P(M 2 ) 


M 2 



Fig. 31.2 (a) The spectral function 

showing a single-particle peak, (b) The 
spectral function showing a quasiparti¬ 
cle peak with lifetime (2T P ) — b 


It can be shown, without the aid of perturbation theory, what the gen¬ 
eral form for the interacting propagator G should be. 6 The propagator, 
which is built out of free operators a p and a' p , will not only describe the 
propagation of single-particle states, but also describe the fate of the 
multiparticle states, and takes the form 


G(P) 


i Z p 

p 2 — mp + iT p 


multiparticle 

parts 


(31.9) 


where, as before, Z p = (p*|^(0)|f2) is the amplitude for creating a single¬ 
particle state \p\) out of the interacting vacuum and mp is the mass of 
the state |p A ). Notice the role of T p here: it is the inverse quasiparticle 
lifetime in the place of the usual infinitesimal factor e. 


Example 31.2 


One way to write the full propagator that makes contact with experiment uses the 
spectral density function p(M 2 ). For example, for scalar field theory we rewrite 
the propagator in terms of the free propagator 7 A(x,y, M 2 ) which is now made a 
function of particle mass as well as the spacetime points x and y : 

roo r i at2 

G(x,y)= ——p{M 2 )A(x,y,M 2 ). (31.10) 

Jo (2tt) 

Suppose we have a spectral density function 

P (M 2 ) = (2 rr)S(M 2 - m 2 )Z + ( ) . (31.11) 


This is shown in Fig. 31.2(a). It is made up of a single-particle peak at mass M = mp: 
the physical mass of a single particle in the interacting theory. It also contains a 
multiparticle continuum which starts around 4m.p, which is the threshold for making 
two real particles in the centre of mass frame. This sort of spectral function is the 
one usually seen in particle physics problems where the creation of more than one 
particle is due to pair production. Plugging in this spectral density we obtain 


G(p) = 


i Z 


p 2 — rrip 


+ ie 


L 


d M 


4m£ 


2t r P(M \z-M 2 + ie' 


(31.12) 


Notice that the lifetime of the particle is e~ 1 , which is effectively infinite since the 
particle pole is infinitesimally close to the real axis. We expect these particles to be 
long-lived and stable. 

Now suppose we have the spectral density in Fig. 31.2(b), which has a quasiparticle 
peak with width 2T p . This leads to a propagator 


G{p) 


i Zp 

p 2 - m 2 + iT p 


multiparticle 

parts 


(31.13) 


where the quasiparticle now has a finite lifetime (2r p ) _1 (and we allow the possibility 
that Z is a function of p) . We see that our requirement that E p > F p for a meaningful 
quasiparticle tells us that the width of the peak in the spectral weight should be 
narrow compared to the distance of the peak from the origin. A spectral weight 
with a broad quasiparticle peak is often seen in condensed matter physics where 
there is usually a large number of very closely spaced energy levels. As a result 
the quasiparticle is a wave packet spread over (potentially very many) levels which 
eventually decays away. 

In addition to the quasiparticle having a finite lifetime it can also be shown that 
the multiparticle part usually undergoes some degree of destructive interference and 
decays after a time that is fairly small compared to Tp 1 , leaving the lone quasiparticle 
amongst the ruins! 
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Once we obtain the full propagator G{p) we can immediately read off 
the mass energy of the excited states mp from the position of the pole 
in the function, the residue at that pole gives us i Z pi the lifetime is 
determined from T p and, of course, we can use G(p) to calculate other 
quantities too, such as scattering amplitudes. 


31.4 Elementary quasiparticles in a metal 

The concept of dressed particles is especially important in condensed 
matter physics. Although, in reality, the condensed matter physicist 
usually deals with a system made up from a large number of very strongly 
interacting, real particles (such as electrons in a metal), renormalization 
involves changing our point of view and describing the system in terms 
of a small number of weakly interacting or non-interacting, fictitious 
particle-like excitations. Despite the fact that the ground state of a 
system, such as a crystal for example, isn’t an empty box 8 we regard 
it as a vacuum, in that it contains no excitations. A weakly excited 
state of the whole system (that is, one at low temperature) can then be 
described in terms of a small number of these excitations. These are 
often also called the elementary excitations of the system. 9 

A distinction can be made between two types of elementary excitation. 
The first are called collective excitations and correspond to particle¬ 
like excitations that occur through motion of all of the constituent parts 
of the underlying system. An example is the chain of coupled masses we 
considered in Chapter 2. There we described the excited states of the 
system, which involved the motion of all of the masses, as particle-like 
entities called phonons. 10 

The second sort of elementary excitation is the quasiparticle, a single¬ 
particle excitation from the non-interacting system dressed in interac¬ 
tions. An example is the single-particle excitation of an electron gas. In 
a non-interacting electron gas [Fig. 31.3(a)] we have single-particle exci¬ 
tations above the ground state called electrons and holes 11 [Fig. 31.3(b)]. 
As we turn on the interactions these excited electrons and holes become 
quasielectrons and quasiholes [Fig. 31.3(c)] . 


Example 31.3 

Let’s look at the case of electron quasiparticles in more detail. We start with the 
non-interacting version of a metal: the Fermi gas at T = 0. This comprises a box 
of N electrons, stacked up in momentum eigenstates | p) up to the Fermi level |j>p). 
(For simplicity we regard the electrons as spinless and, in that case, the momentum 
states may contain either 1 or 0 electrons.) The momentum distribution for the 
Fermi gas is shown in Fig. 31.4(a). This is the ground state of this non-interacting 
system, which we call |0). The particle dispersion is given by E^ = p 2 /2m e , where 
m e is the free electron mass. Note that near the Fermi level the dispersion may be 
written = up(|p| — pp), where 

v P = Vp£p| lp | =pF = ^-, (31.14) 


^Far from it, it usually comprises N 
10 23 particles of one sort or another. 


^ The approach of describing a system 
in terms of small numbers of excitations 
works in condensed matter physics be¬ 
cause we are most often interested in 
the behaviour of a solid at low temper¬ 
ature and this behaviour is dominated 
by the occupation of only those energy 
levels of the system very close to the 
ground state. 

^ Notice that if we turn off the inter¬ 
actions in the system (in this case the 
springs between the masses) then these 
excitations cease to exist. Another ex¬ 
ample of a collective excitation is the 
plasmon, which is a collective excita¬ 
tion of all of the electrons in a metal. 
We will examine these in Chapter 43. 


(a) .1. (b).|. (c).4 


Fig. 31.3 (a) The non-interacting elec¬ 
tron gas with a single electron above 
the Fermi level, (b) The particle exci¬ 
tations in the non-interacting electron 
gas. (c) The quasiparticles in the inter¬ 
acting electron gas [identical to (b) but 
with renormalized masses]. 

11 Remember that in the picture dis¬ 
cussed here, excitations only exist 
above the ground state. The ground 
state here consists simply of electron 
states filled up with electrons all the 
way to the Fermi level. In the non¬ 
interacting system an electron above 
the Fermi level is an excitation, which 
becomes a quasiparticle upon turning 
on the interactions. An electron be¬ 
low the Fermi sea is not an excita¬ 
tion and does not become a quasipar¬ 
ticle. A contrasting philosophy, where 
all electrons become quasiparticles, is 
Landau’s Fermi liquid theory, which is 
examined later in the chapter. 
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12 

The Fermi surface is spherical here so 
the Fermi velocity only depends on the 
magnitude of p F and we write |p F | = 
PF • 


( a ) np 



Fig. 31.4 (a) The momentum distri¬ 
bution of the Fermi gas. (b) The mo¬ 
mentum distribution of the interacting 
Fermi system. A discontinuity of size 
Z PF exists at the Fermi surface. 


( a ) Ep 



Fig. 31.5 (a) The dispersion of the 

free particles in a Fermi gas. (b) Near 
the Fermi momentum the dispersion is 
given by E p = (|p| — Pf)^F) where up = 
Pf/tyi* and m* is the effective mass of 
the particles. 


is known as the Fermi velocity 12 [see Fig. 31.5(b)]. 

We now turn on interactions between the electrons. The interacting ground state 
of the system is |Q), the eigenstates of the system are now called |p^), and the energies 
of these states are now E p . Near the Fermi level we again have E p = up(|p| — Pf)> but 
now u F = PF/ m *> where we call m* the effective mass. Following from our assertion 
that we may describe the weakly excited states of a system as quasiparticles, we 
expect that those states that lie close to the Fermi level pp will have nonzero values 

of Z p and may therefore be described as containing quasiparticles. As will be shown 
in Chapter 43 the free propagator for particles in a metal is given by 


Go(p) = 


i0(W — Pf) , i0(PF-|p|) 


(31.15) 


The interacting version becomes 

r, t s _ lZ p e(\p\ -p F ) iZ p d(pp - |p|) 


(31.16) 


q, n _ iZp9(\p\ - p F ) \Zp9(pp - |p|) / Multiparticle \ , gl . 

[P> E — E p + ir p E-Ep-iTp V P arts ) ' 

It will be useful in what follows to have the propagator in three-momentum space and 
the time domain. This is achieved by doing a Fourier transform (see Appendix B) 
with the result 


G(p, t) = 


p\ - p P )e -ifcpt e -ri>t - 0(-t)0(p F - |p|)e _iE P t e r P t 


Multiparticle 

parts 


(31.17) 


We may use the result of the last example to test whether we really have 
quasiparticles in a real-life metal by making a direct measurement of the 
quasiparticle weight Z p . To measure Z p we make a measurement of the 
momentum distribution n p of the weakly excited system (that is, when 
it contains some quasiparticles), given by n p = ( a) p a p ) which may be 
related to the propagator via (see Exercise 31.1.) 

n p = -lim t ^o-G(p,f), (31.18) 

giving us a prediction 

n P = 2 P 9( W -H)-( M '“^““ Cle ). (31.19) 

Assuming that the multiparticle contribution varies smoothly, we con¬ 
clude that the interacting metal should have a discontinuity in its mo¬ 
mentum distribution n p at its Fermi surface of size Z | P | =PF , as shown 
in Fig. 31.4(b). This prediction can be tested by performing Compton 
scattering experiments on metals. The results are found to be in good 
agreement with the theoretical predictions. 


Philip Anderson is arguably the most 
influential condensed matter physicist 
since Landau. Not only is he the dis¬ 
coverer/creator of a variety of theoret¬ 
ical descriptions of electronic and mag¬ 
netic phenomena (among many other 
things); his elucidation of the structure 
underlying the subject pervades current 
condensed matter physics. (Even the 
name ‘condensed matter physics’ is one 
of his contributions!) 


31.5 The Landau Fermi liquid 

In general, I believe that the attitude is at least justifiable 
that the instructive use of quasi-particles by the founders of 
solid state physics and by Landau’s school is hardly less rig¬ 
orous than the sophisticated many-body theory, and perhaps 
it is more foolproof, because it has less appearance of being 
rigorous. 

P. W. Anderson (1923- ), Concepts in Solids 
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A rather different way of understanding metals in terms of fictitious par¬ 
ticles was formulated by Lev Landau 13 and is known as Fermi liquid 
theory. It has been so useful in understanding metals that it is lit¬ 
tle exaggeration to claim that it has become the standard model of the 
metal. 14 Landau’s theory is phenomenological, but appealingly intuitive 
since it involves a description of a strongly interacting metal as being 
almost identical to the non-interacting Fermi gas! Moreover Fermi liq¬ 
uid theory allows the complete description of an interacting system in 
terms of a small number of parameters, while avoiding all of the com¬ 
plexities of perturbation theory. Central to the model is the fact that 
Landau describes an interacting metal in terms of quasiparticles; but 
these are different to the field theory quasiparticles described thus far 
in this chapter. We will call the former Landau quasiparticles to avoid 
confusion. 

In Landau’s picture we pay particular attention to the process of ‘turn¬ 
ing on’ the interaction between the non-interacting electrons of the Fermi 
gas. Landau assumed that if we very slowly turn on the interaction the 
system evolves continuously from Fermi gas to Fermi liquid, with each 
single-particle momentum eigenstate of the gas evolving into a single¬ 
particle momentum eigenstate of the liquid. This adiabatic turning on 
may be shown in ordinary quantum mechanics to lead to a very small 
amplitude for transitions out of the level, providing the density of final 
states is small, as it is for electrons within the Fermi sea as a result of 
the Pauli principle. 

Vital to Landau’s Fermi liquid concept is the notion that all of the 
the states | p) occupied by an electron in the gas with n p 0) = 1 be¬ 
come single-particle eigenstates | p) in the liquid occupied by a Landau 
quasiparticle. The Landau quasiparticles in the interacting ground state 
therefore also have momentum distribution . We say that there is a 
one-to-one correspondence between the free particles and Landau quasi¬ 
particles. On turning on the interaction the Landau quasiparticles take 
on an effective mass m* which parametrizes the change in energy of the 
eigenstates due to the effect of the field from the other quasiparticles. 

Despite the change in energy, there should be no ambiguity in the 
identity of an occupied eigenstate after turning on the interaction and 
therefore the energy levels of the states shouldn’t cross during the turn¬ 
ing on process. That is to say, upon turning on the interaction we must 
have the process shown in Fig. 31.6(a) rather than that in Fig. 31.6(b). 
We might ask whether this non-crossing of levels is a realistic propo¬ 
sition. Recall that when two eigenstates are related by a symmetry 
transformation which leaves the Hamiltonian invariant we obtain a de¬ 


generacy in energy. If we turn on a perturbing potential V which leads 
to a nonzero matrix element 5 between the two states we have a Hamil¬ 
tonian H = ( ^ ) which leads to a splitting in energy of 26. It’s 

as if the energy levels repel each other by virtue of a matrix element 6 
existing between them, which prevents them from ever crossing. The 


consequence of this for the metal is that, as we turn on the interaction, 


13 Lev Landau (1908—1968) was a So¬ 
viet physicist and one of the great¬ 
est scientists of the twentieth century 
who made contributions in many ar¬ 
eas, including phase transitions, mag¬ 
netism, superconductivity, superfluid¬ 
ity, plasma physics and neutrinos. He 
was also the model for the principal 
character, Viktor Shtrum, in Vassily 
Grossman’s novel Life and Fate. 

14 The concept of a Fermi liquid is a 
high successful phenomenological the¬ 
ory of interacting electrons. It is a bit 
of a departure from our main theme 
of field theories of interacting systems, 
but it is too useful to omit and more ad¬ 
vanced treatments than the one given 
here demonstrate that the machinery of 
diagrammatic perturbation theory can 
indeed be employed to derive important 
results in Fermi liquid theory. 



Fig. 31.6 Turning on interactions 
changes the states but the energy lev¬ 
els shouldn’t cross during the turning 
on process. Thus we can have (a) but 
not (b). 
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15 

This is exactly the case at a phase 
transition when, as we’ve seen, the sys¬ 
tem finds a new ground state with 
a lower symmetry than its previous 
ground state. The requirement that the 
levels don’t cross is therefore a require¬ 
ment that the evolution of the system 
doesn’t involve a change in thermody¬ 
namic phase. 

Energy 

A ' 



’g turning on 


es 

Fig. 31.7 Evolution of single-particle 
energy levels as interactions are turned 
on. Notice how they repel when they 
draw close to each other and never 
cross. 

Notice that this sets a limit on the 
speed at which we may turn on the 
interactions. Although we would like 
to turn them on infinitely slowly, any 
slower than T~} and our excited quasi¬ 
particle loses its momentum and, con¬ 
sequently, its identity. For a general 
quantum liquid to exist it must be pos¬ 
sible to select a turning on time that 
allows the ground state to evolve adia- 
batically, but also retains the low-lying 
quasiparticles. 

17 

Holes will be examined in Chap¬ 
ter 43. 


the levels repel each other when they get too close and never cross. This 
is shown in Fig. 31.7. However, this repulsion does not occur when the 
matrix element between two levels is zero. This is the case when the 
levels have different symmetries and then there is nothing stopping the 
levels from crossing. 15 Since in the absence of a phase transition the slow 
turning-on of interactions provides a one-to-one correspondence between 
Landau quasiparticles and free particles, the theory is said to possess 
adiabatic continuity. 

Notice how different the Landau quasiparticles are to our field the¬ 
ory idea of quasiparticles. In field theory the ground state contains 
no quasiparticles; in Fermi liquid theory it contains as many Landau 
quasiparticles as we had electrons in the Fermi gas, guaranteeing charge 
conservation, provided Landau quasiparticles carry the same charge as 
electrons. Landau quasiparticles do not, therefore, rely on the renor¬ 
malization constant Z p to guarantee their existence as our field theory 
quasiparticles do. 

An insight comes when, instead of starting with a ground state Fermi 
gas, we start with a ground state Fermi gas with an additional electron 
added with \p'\ > pp and, again, turn on the interaction. We would hope 
that the excited electron evolves into a quasiparticle in a momentum 
eigenstate | p'). However, unlike the electrons within the Fermi sea, the 
extra electron potentially has rather a lot of phase space to explore as we 
turn on the interactions. Through interactions with the other electrons 
it may therefore have a considerable amplitude to scatter out of its state 
|p') and so the excited quasiparticle will have a finite lifetime ( 2 r p /) _1 
against scattering into other states and decaying. 16 The phase space for 
this scattering, restricted by the Pauli principle, is such that those states 
near the Fermi momentum have the smallest probabilities for scattering 
and therefore survive for longest in quasiparticle momentum states. This 
is examined in the exercises with the result that the decay rate varies as 
F p ~ (|p| — pf) 2 - We conclude that really it is meaningful to speak of 
the properties of a Landau quasiparticle only near the Fermi surface in 
a metal where T p is small. 

Finally, if we lift a Landau quasiparticle out of a level below pp in 
the ground state and promote it to a level with momentum A p above 
Pf, we simultaneously create an unoccupied quasiparticle state or hole 17 
with momentum A p below pp. Since A p = \p\ — pp, the quasielectrons 
and quasiholes will both have decay rates r p oc (jp| — pp) 2 , so it is only 
meaningful to discuss their properties if their momenta are close to pp. 

To summarize the philosophy of the Fermi liquid: 

The Fermi liquid ground state contains Landau quasiparticles stacked 
up to the Fermi level. Landau quasiparticles have the same charge as 

ordinary electrons but their mass takes on a renormalized value m*. 

A quasiparticle excitation outside the Fermi sea (or, equivalently, a 
quasilrole within the sea) will have a decay rate T p oc (|p| — pf) 2 - 


A cartoon of the Landau Fermi liquid theory is shown in Fig. 31.8. 
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Where the Fermi liquid and quantum field theories coincide is in the 
description of the weakly excited states of the metal. The excited Landau 
quasiparticles have exactly the same properties as our quantum field 
theory quasiparticles and the argument about their lifetime applies to 
both. Near the Fermi surface the energy of an excited quasiparticle may 
be written E p ss ^{\p\ — pf) and its decay rate T p oc (|p| — pf) 2 We 
see that for small (|p| — pp) the real part of the quasiparticle energy 
will be larger than the imaginary part, or E p > r p , and a meaningful 
quasiparticle exists in the metal. 

We now complete our description of the Landau Fermi liquid with 
a discussion of the total energy of a weakly excited state of a metal. 
In the spirit of many of Landau’s arguments, in which any ignorance 
of microscopic details is no barrier to formulating a meaningful power 
series expansion, we write the energy of the Fermi liquid in the limit of 
a low density of quasiparticles as 

e = e s + J2( e p ] - Sn p + \ E fpp' Sn p 6 u p' + • • • ’ ( 3L2 °) 

V pp' 

where <bi p = n p — n p ' > is the difference between a distribution with exci¬ 
tations and the ground state distribution 18 and E g = J2 P E p \. p ^ is the 
energy of the ground state. The term linear in 5n p describes the excita¬ 
tion of isolated quasiparticles of energy E p 1 — p = dE/dn p . The second- 
order coefficient 19 f pp > = d 2 E/dn p dn p ’ describes the contribution from 
quasiparticle-quasiparticle scattering (i.e. from interactions), and can 
be described via an interaction Hamiltonian Ti\ = \ \2 pp , f pp ’ Sn p 5n p '. 


Example 31.4 

To understand the scattering process in more detail, one should include spin and 
this amounts to replacing f pp / by /p^y^/. One can show 20 that for spin-conserving 
interactions this quantity can be written as 

fpcr;p'a' = f S (cOS0) + / a (cOS 0)cT • Cr' , (31.21) 

where cos 6 = ■ The functions f s and / a can be expanded in terms of Legendre 

polynomials so that 

, oo 

f s,a (cos9) = —— J2 p i(oo S e) F ^, (31.22) 

9( e f) fr'o 

where EJ and Ff are Landau parameters and g(Ep) is the density of states at 
the Fermi level. 21 These expressions allow some of the key properties of the Landau 
Fermi liquid to be deduced. For example, one can show that the effective mass is 
ra* = m(l-\-F and the spin susceptibility is \ = M0Mb#(-^f)/(1 + -Tq ). Many of the 
predictions of Landau’s Fermi liquid theory may be shown (after much hard graft!) 
to coincide with those of quantum field theory and, owing to its comparative ease of 
use, it is still regarded as the best way of understanding the results of experiment on 
numerous condensed matter systems. 



Fig. 31.8 A cartoon of the Landau 
Fermi liquid, (a) The non-interacting 
Fermi gas with a single electron pro¬ 
moted into an excited state (indicated 
as a dotted line), (b) In the Landau 
Fermi liquid the interactions are turned 
on so that the electrons become quasi¬ 
particles and the holes become quasi¬ 
holes. There is a one-to-one corre¬ 
spondence between the quasiparticles 
and quasiholes in the interacting liquid 
and the electrons and holes in the non¬ 
interacting picture. 

^This implies that it is neither n p nor 
rip 0 ) that is the crucial quantity, but 
rather it is their difference Sn p , which 
tells us the number of excitations in the 
excited state. This is fortunate since we 
know neither n p nor n p°) with much ac¬ 
curacy, but we can find Sn p . 

^The partial differentials in the ex¬ 
pressions for _Ep°) — /l and f pp t are eval¬ 
uated about the ground state configu¬ 
rations, i.e. a frozen Fermi sea. 

20 

Further details and a fuller account 
may be found in the book by P. Cole¬ 
man. 


21 See Chapter 43. 


We have seen in this chapter that renormalization is an essential step 
in understanding what happens to particles in an interacting quantum 
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Fig. 31.9 The process of doing quan¬ 
tum field theory includes the necessary 
step of renormalizing. 


field theory. In fact, as shown in Fig. 31.9, we could summarize the steps 
to make a working theory as: (i) write a Lagrangian; (ii) quantize the 
free part; (iii) derive Feynman rules for interactions; (iv) renormalize. 
However, renormalization is often presented as a method of eliminating 
nonsensical infinities from theories. The elimination of these infinities is 
the subject of the next chapter. It should be remembered throughout 
that renormalization is necessary for all interacting theories, indepen¬ 
dent of whether or not we have infinities. 

Chapter summary 

• Turning on the interactions in a theory changes the properties of 
the particles. We say that they dress themselves in interactions. 

• The interacting propagator is 

G(p) = ——S-b (multiparticle parts), 

p z — nip + le 

where Z p is the quasiparticle weight 

• In Landau’s Fermi liquid theory there is a one-to-one correspon¬ 
dence between electrons in the non-interacting theory and quasi¬ 
particles in the interacting theory. The effect of interactions is to 
renormalize physical quantities, such as the effective mass, via the 
Landau parameters Ff and F'f. 


Exercises 

(31.1) Verify that, for fermions, 

n v = — ^lim G(p,t). (31.23) 

(31.2) Consider an electron with energy Ei > Ep scatter¬ 
ing with an electron with energy E 2 < Ep at T = 0. 
In order for this to occur we must have final elec¬ 
tron state E 3 > Ep and E 4 > Ep. 

(a) Show that this implies that the lifetime of an 
electron with E\ = Ep is infinite. 

(b) If Ei is a little different to Ep, why does the 
scattering rate go as (Ei — Ep) 2 ? 

(c) For T ^ 0 argue that we expect a scattering 


rate - = a(Ei — Ep ) 2 + b(k^T ) 2 , where a and b are 
constants. 

(31.3) (a) Using the relation 

—-— = — — i7r<S(a:o)j (31.24) 

xo + ie xo 

(discussed in Appendix B) identify the delta func¬ 
tion part of the propagator E _^ +ie ■ 

(b) For the propagator E _^\ ir , show that the 
full width at half maximum of the peak is given by 

2 r p . 













Renormalization: the 
problem and its solution 



It turns out that the obvious generalization of this idiotically 
simple manipulation gets rid of all of the infinities for any 
field theory with polynomial interactions, to any order in 
perturbation theory. 

Sidney Coleman (1937-2007) 

In the last chapter, we saw how interactions dress a particle and change 
its propagator. The dressed particles were quite different to the bare par¬ 
ticles that are revealed after canonical quantization of the free theory. 
This might mean that our perturbation theory, which involved expan¬ 
sions in the bare mass m and coupling constant A, is at risk. It is: it 
turns out that we’ve been doing our perturbation theory about the wrong 
masses and coupling constants! The solution to this problem involves 
making a shift so that we’re expanding in physical masses and coupling 
constants. The procedure also has a bonus. It solves a big problem in 
our perturbation theory: the problem of divergent amplitudes. 

In this chapter we approach the problem by starting with the diver¬ 
gences in our perturbation theory. We will look at why we obtain infini¬ 
ties in our calculations and suggest an idiotically simple mathematical 
workaround. The physical content of this workaround will turn out to 
shift the parameters of the theory from the wrong ones (m and A) to 
the right ones (mp and Ap). 
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32.1 The problem is divergences 


We’ve looked at calculating amplitudes and propagators for several in¬ 
teracting theories. Perhaps you’re curious (or suspicious) as to why we 
haven’t done the integrals at the end of the calculations. The reason is 
that many of them diverge. Understanding the source of the divergences 
relies purely on the following facts (with a finite and positive): 



,n+l 


n + 1 

[lnx]“ diverges 

_-m+1 i 00 


diverges for n > 0, 


— m + 1 


7 — m+1 


(32.1) 

(32.2) 


m — 1 


for m > 1. (32.3) 
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In the first integral the divergence arises because there are more powers 
of x in the numerator than the denominator. The second integral, where 
the number of powers of x is the same on top and bottom, is called log¬ 
arithmically divergent, for rather obvious reasons. The last example 
is convergent. 

Let us return to q i 4 theory, which is described by a Lagrangian 

£=^) 2 -^ 2 -^ 4 . (32.4) 

We ended our analysis of this theory in Chapter 19 with a perturbation 
expansion encoded in Feynman diagrams. Consider the amplitude for 
two-particle scattering, which is equal to the sum of all connected, am¬ 
putated Feynman diagrams with four external legs. Up to second order 
in the expansion (i.e. drawing all diagrams with one or two interaction 
vertices) we obtain the diagrams shown in Fig. 32.1. The main message 
of this section is that the diagrams with loops in them [(b),(c) and (d)] 
are divergent. This is to say, with the upper limit of the integral set 
to infinity we get infinite and therefore nonsensical answers. This is a 
disaster. 



Fig. 32.1 The expansion of four- 
point diagrams for two-particle scatter¬ 
ing up to second order in the interaction 
strength. Diagrams (b)-(d) define the 
processes labelled s, t and u, respec¬ 
tively. 


Example 32.1 

Let’s see how this unfolds. We’ve encountered the first diagram before. The result is 

iAd a = -iA. (32.5) 

For the subsequent diagrams, the integral we’ll need is 

d 4 q i i /A\ 

Jo (27 r ) 4 q 2 — m 2 + ie (p — q ) 2 — m 2 + i€ \ p ) 

where a is some numerical constant whose exact value isn’t important to us. Notice 
that the limits of this integral are zero and A. The parameter A is a large momentum 
cut-off, and we want to send A —» oo. By counting powers we see that there are four 
powers of momentum on top and four below, telling us that the integral ~ f is 
logarithmically divergent. In fact, for the diagrams in Figs. 32.1(b)—(d) we obtain 
the amplitudes: 

iAdb = i^A 2 {in A 2 — In [(pi + P 2 ) 2 ] } = iaA 2 {in A 2 — In s} , (32.7) 


iAd c = iaA 2 {in A 2 — In [(pi — P3) 2 ] } = iaA 2 {in A 2 — lnt} , (32.8) 


iAdd = iaA 2 {in A 2 — In [(pi — P4) 2 ] } = iaA 2 {in A 2 — In a} . (32.9) 

All of these tend to 00 as A —» 00 . The total amplitude for two-particle scattering is 
given by the sum of these diagrams, giving 

iAd = = i (Ad a T Adb T Ad c T Add) 

= — iA + iaA 2 {3 In A 2 — Ins — lnt — In a} , (32.10) 

and so we need to find a way to tame the iaA 2 {3 In A 2 } oc In A term. 
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We can avoid the divergence in these integrals if we make the pragmatic 
choice to only integrate up to a large, but finite, value of the cut-off A. 
This amounts to deliberately ignoring any fine-scale details in the fields 
below a length scale ss 1/A. This is frequently what is done in condensed 
matter physics, where we usually ignore all detail smaller than the size of 
an atom. If we apply our theory to fundamental particles it’s not quite 
so obvious why there should be a smallest scale but it might represent 
some graininess in spacetime. 1 

For now, let’s see if we can live (uncomfortably) with A oo. In¬ 
troducing finite A has an immediate consequence: the amplitudes we 
calculate using Feynman diagrams will depend on A. This is a serious 
problem: if a prediction for something we want to measure depends on 
A then we have an arbitrary constant which will need to be given some 
value. One strategy to avoid this is to add terms to the Lagrangian that, 
when we do the perturbation expansion up to some order (second order 
say), will remove the dependence of amplitudes on A. Doing this will 
clean up the theory up to second order. Although we should still expect 
the higher order terms to diverge (i.e. blow up as A —> oo), we’ll at least 
have healed the theory to the extent that we can obtain a prediction of 
physical behaviour that is valid up to second order. 


1 Actually, the idea of a length scale like 
this emerges very naturally from a way 
of looking at the Universe called the 
renormalization group, which we dis¬ 
cuss in Chapter 34. 


32.2 The solution is counterterms 


The terms that are added to the Lagrangian to remove the dependence 
on A, and hence the divergences, are called counterterms. We saw 
that for cf > 4 theory at second order the divergent part of the amplitude 
looked like 6iaA 2 In A; we therefore change the Lagrangian by adding a 
counterterm — (6aA 2 In A)</> 4 /4! thus: 






We now have a Lagrangian 


£=\(d„<j>) 2 





(32.11) 


(32.12) 


where the counterterm C ^ = —6aA 2 In A and we write the superscript 
(2) to remind ourselves that we’re only getting rid of divergences at 
second order. 

Now we start again with our new Lagrangian. We canonically quan¬ 
tize, use the Dyson expansion and figure out the Feynman rules and then 
sum the new diagrams. With the counterterm, the Dyson expansion of 
the S’-operator is now just 

S = Te~' 1 S Aix * t(^ 4 -c (2) <? 4 ). ( 32 . 13 ) 


Every order of the expansion therefore also includes a contribution from 
the counterterm, whose Feynman diagram is shown in Fig. 32.2. Since 
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Fig. 32.2 The C-counterterm. 


the counterterm is proportional to <^ 4 it has the same behaviour as the 
term. It therefore represents a vertex which, instead of carrying a 
factor — iA carries a factor i C^ 2 \ 

We calculate amplitudes to second order in A and (we only need to) 
include one diagram involving a counterterm and we obtain 

i M {2) = -iA + iaA 2 [3 In A 2 - Ins - Inf - lnu] + iC (2) , (32.14) 

and substituting C^ = —6aA 2 In A yields 

iA4 (21 = — iA — iaA 2 (Ins + Ini + In a). (32.15) 

This amplitude depends on the momenta of the incoming and outgoing 
particles (which is no problem) but crucially it doesn’t depend on A. 
We’ve done what we set out to do! Predictions for the theory will now 
not depend on an arbitrary cut-off A, and hence won’t yield infinity 
when we send A —> oo. 


32.3 How to tame an integral 

We now have a plan, which is to start calculating using the Lagrangian 
and then add a counterterm when we see a divergence. We then work 
out the coefficient C^ in front of the counterterm and all is set. Quite 
apart from the as yet unexamined physical consequences of this extra 
term, we might worry that this process will turn out to be ultimately 
futile. What if you calculate to second order as above, but at third order 
not only do we need a counterterm C^<f> 4 but also a new counterterm 
D^(f> e to swallow up some extra divergent term that emerges? What if 
it gets even worse and, at 27th order in the expansion, we need to add 
513 new types of counterterm to cancel all of the divergences and then, 
at 28th order, we find we need 752 new counterterms? A theory might 
never stop absorbing new types of counterterm! 

Amazingly, it often turns out that we only need to add a small number 
of types of counterterm (three, for example, in QED) and that these 
cancel divergences to all orders of perturbation theory! We still need to 
know the right constants (like C^ n ’) for each order of perturbation theory 
in order to cancel the divergences, but these will turn out to be numbers 
that we can calculate. Theories that have the property that only a finite 
number of counterterms are needed to cancel all divergences are said to 
be renormalizable theories. This is the miracle of renormalization. 

p — k — q 

P \ 7 P Example 32.2 

k To get a feel for this we’ll look at the Saturn diagram, shown in Fig. 32.3. Written 

out in full the amplitude is 

(—iA) 2 r A d 4 ^ d 4 /c i i i 

Fig. 32.3 The Saturn diagram. 6 J 0 (2-7t) 4 (2n) 4 q 2 - m 2 + ie k 2 - m 2 + ie (p - q - k) 2 - m 2 + ie 

(32.16) 
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These integrals are getting worse! If we’re going to systematically renormalize any 
theory we’ll need a system. The system involves writing the integral as a polynomial. 
Usefully there’s a key fact that will help us: Feynman diagrams can be expanded as a 
Taylor series in the external momentum. The polynomial representing the diagram 
will have divergent coefficients, reflecting the divergence of the integral. We’ll pick 
our counterterms to cancel the divergences in the coefficients. Happily, this results 
in counterterms of the right type to cancel the divergences in a particular integral. 

The integral in eqn 32.16 may be written in a series about p = 0 as 


I = a + 0p 2 + 7p 4 + ... , 


(32.17) 


where we don’t have any odd terms because of the symmetry <p = — </> in the original 
Lagrangian. 

Here’s how to find the counterterms: 

• The Saturn diagram integral is quadratically divergent. It has eight powers of 
momentum on top and six on the bottom. If we set the external momentum 
p equal to zero in our expansion then I = a and we conclude that a di¬ 
verges quadratically. This tells us that we need one counterterm that diverges 
quadratically. 

• If we differentiate the Saturn integral twice with respect to p then it becomes 
logarithmically divergent, since we have reduced the number of momenta on 
top by two. If we differentiate the series expansion I twice and set p = 0 we 
get I" = 2/3 and we conclude that (3 diverges logarithmically. We therefore 
need another counterterm which is logarithmically dependent on the cut-off 
which will have a Feynman rule that it appears multiplied by p 2 . 


• Differentiating the integral two more times makes it convergent so we don’t 
need any more counterterms. 

We need two counterterms to cancel the divergences in the Saturn integral I: one 
with coefficient A quadratically dependent on A and one with coefficient B which is 
logarithmically dependent. How do we make sure that the B counterterm will give 
a Feynman amplitude that is multiplied by p 2 and that A will not be multiplied by 
anything? The answer is to introduce counterterms into the Lagrangian that look 
like 

-Cct = ^0 2 +|(a M 0) 2 , (32.18) 

remembering that in momentum space [<9^</>(#)] 2 —* 4 ) {—p)p 2 4 > {p) we have 

B(d^(j)) 2 —» p 2 B and A —> A. Adding the counterterms gives the full Lagrangian for 
renormalized </> 4 theory: 



£ = \(d^ - ^ ^ + f (d»<P) 2 + (32.19) 

Notice that all of the counterterms have the same form as the terms in the original 
equation; they just have different coefficients. 

These counterterms all have Feynman diagrams and rules attached to them. We 
saw before that the -counterterm diagram corresponds to a rule i C^ n \ The B 
counterterm makes a contribution i(p 2 B^), where p is the momentum along the line 
and the A counterterm contributes i A ^. The A and B terms are usually represented 
by the single diagram shown in Fig. 32.4, which has a rule i (B^p 2 + A ( n )). 


Fig. 32.4 The A and B counterterm 
diagram 


To summarize, we have the Feynman rules for renormalized </> 4 theory 
given in the box. These eliminate all infinities. 
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2 Note that <j> r {x) are known as renor¬ 
malized fields. We assume Z is inde¬ 
pendent of three-momentum here. 


Feynman rules for renormalized (/> 4 theory 

• A factor p2 l|||J|| |for each propagator. 

• A factor —iA for each interaction. 

• Add sufficient counterterm diagrams to cancel all infinities. 

• A factor i (lT n 'p 2 + AA n ') for each counterterm propagator, 
where n is the order of the diagram. 

• A factor ifor each interaction counterterm, where n is the 
order of the diagram. 

• All other rules regarding integrating, symmetry factors and over¬ 
all energy momentum conserving delta functions are identical as 
for the case of unrenornralized perturbation theory. 


32.4 What counterterms mean 

We now turn to the question of the physics behind the counterterms. 
We’ve added terms to the Lagrangian which surely alters the physics 
we’re studying. What we are going to find is that the counterterms shift 
the parameters from fictional ones to the real-life ones, simultaneously 
removing infinities and forcing our theory to describe real life. To recap: 
we started with an interacting theory, 

£ = ^0) 2 -^ 2 -^ 4 , (32-20) 

with the mass and coupling constant m and A respectively. We’ll call 
eqn 32.20 the unrenornralized Lagrangian. The excitations made by the 
field operators when the coupling A = 0 have mass to. However, when 
we turn on the coupling this Lagrangian gives rise to revolting infinities 
unless we cut off the integrals at some momentum A. 

To fix the divergence encountered as A —> oo we find we need to renor¬ 
malize the theory. As described in the previous chapter, renormalization 
involves accepting that we’re dealing with dressed particles, which have 
a mass top and coupling Ap. These particles only involve a fraction 
Z? of the fields. This realization doesn’t remove the infinities, however. 
To do that we need to include counterterms in our Lagrangian. The 
renormalized Lagrangian is 

r' — I/O As ~\ 2 TO P j.2 'V /4 

£ ~ 2 4 | 

+ ^(^^ r ) 2 + (32.21) 

where we have rescaled the fields using 2 </> = \[Z(j) r (x) and called the 
mass top and coupling constant Ap respectively. This theory won’t give 
us nonsensical, infinite results for A —> oo. 
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Example 32.3 

To see the consequence of renormalization, we now collect the coefficients of the 
renormalized Lagrangian 

r XJrB (P>A \2 ( m p- A ),2 (*P-C) ,4 

C = ——(cb<Pr)--- <p T - - - <t> T , (32.22) 


“ r 4 , rn 

which suggests that the counterterms represent shifts in the parameters in the La¬ 
grangian. Writing —A = Sm 2 , —C = <5 A and L? = 8Z we have 

,5m 2 ) 2 (A P + <5A) 4 


£ = 


1 + 5Z ira .\2 ( m P 

(d^y- 


(32.23) 


2Z ' f " / 2Z T 4!Z 2 

where we’ve restored the original, unrenormalized fields. From this, we read off that 
we can relate our original and renormalized Lagrangians through 

(mp+6m 2 ) ^ _ (Ap-hSA) (32.24) 


4> = \/Z(f) T , Z = 1 + 5Z, 


This tells us that renormalization is simply an exercise in shifting parameters. 

To truly see what’s going on it may make more sense now to replay the argument 
backwards. We start with the unrenormalized Lagrangian and shift from parameters 
m and A to parameters mp and Ap. We start by renormalizing the fields to obtain 

C = 


We then use the shifts 
Z = 1 

giving us 


-6Z, 


1 , 


£ = ^(^r) - 


1,~ f\2 m ,2 A ,4 

2^0) - ~Y<t> - J'<t> 

Z, _ , 9 Zm 2 9 Z 2 A a 

(32.25) 

Zm? = mp + 8m 2 , Z 2 A = Ap + <5A, 

(32.26) 

jA i /o , \2 ^m ±2 ^A ,4 

“ 4T^ r + T ( ^ r) “l~^“4! 0r ’ 

(32.27) 


allowing us to identify the counterterms A, B and C as the shifts in the parameters 
—5m 2 , 5Z and —5A. 


We started the whole venture of quantum field theory by writing down 
a simple Lagrangian with parameters we thought would tell us about 
the masses and couplings of real particles in Nature. It turns out we 
were wrong. We were doing the wrong perturbation theory expanding a 
series in terms of the wrong constant mass m and wrong coupling A. We 
therefore had asked a nonsensical question and got some infinite (and 
therefore nonsensical) answers. Actually we should have been expanding 
our series in terms of a mass mp and coupling Ap. The price we pay 
for making this shift from wrong to right variables is counterterms. In a 
sense, we’re not really ‘adding counterterms’ at all; we’re really taking a 
bare Lagrangian and making a shift of variables from wrong ones to the 
right ones and the shifts required are the counterterms. Renormalization 
is not, therefore, an exercise in hiding infinities, it’s an exercise in making 
a theory describe real life. For <(> 4 theory, the counterterms are divergent, 
meaning that these shifts may be infinite! This implies that m and A, 
the bare mass and coupling are infinite quantities which are shifted by 
infinite amounts upon dressing themselves by interactions. 3 

There’s one last, but very important, point to make. What number 
do we take for mp and Ap? What are the right masses and coupling 
we should expand around? The answer is that they’re the ones Na¬ 
ture’s given us! For this reason, mp and Ap are known as the ‘physical’ 
parameters, in contrast to the ‘bare’ parameters m and A. 


Q 

This explains why it’s often claimed 
that the bare charge of the electron is 
infinite. The spontaneous appearance 
of electron-positron pairs screens the 
charge leading to the finite charge we 
encounter in Nature. 
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32.5 Making renormalization even simpler 


4 The clumsy name, ‘superficial’ degree 
of divergence, arises because the actual 
degree of divergence can be complicated 
in gauge theories and some pathological 
cases, see Peskin and Schroeder, page 
316. 


We’d like a method to avoid having to analyse each diagram separately 
as we did for the Saturn diagram above. Is there a method which can, 
once and for all, tell us how divergent a diagram or class of diagrams is? 
There is and it’s based on simple dimensional analysis. 

Define the superficial degree of divergence 4 of an integral arising from 
a Feynman diagram in </> 4 theory to be 

D f Powers of momentum \ / Powers of momentum \ 

l in numerator J \ in denominator J 

(32.28) 

If D > 0 the integrals diverge, if D = 0 they logarithmically diverge and 
if D < 0 they don’t diverge. For a diagram with Be external lines, the 
degree of divergence in </> 4 is given by D = 4 — Be- 


Example 32.4 

We can prove the above theorem for predicting whether diagrams diverge. In addition 
to the previously defined symbols 

• V is the number of vertices, 

• L is the number of loops, 

• B\ is the number of internal lines. 

Each loop brings with it an integral with four powers of momentum. Each internal 
line brings a propagator which brings —2 powers of momentum. Therefore 

D = 4L — 2B\. (32.29) 

To get the number of loops: L is the number of momenta we integrate over and 
each internal line gives a momentum. The number of loops is less than B\ though. 
Momentum-conserving delta functions eat up integrals and there are V of them (one 
for each vertex, since we must conserve momentum at every vertex). However, one 
of the delta functions conserves momentum for the entire diagram, so doesn’t eat an 
integral. The number of loops L is therefore equal to the number of internal lines 
Bi, minus (V — 1) the number of vertices, with one removed for overall momentum 
conservation: 

L = Bi — (V — 1). (32.30) 

Each vertex has four lines coming out of it. Each external line comes out of one 
vertex and each internal line connects two vertices. Therefore 


4V = B e + 2B t . (32.31) 

Inserting eqn 32.30 and eqn 32.31 into eqn 32.29 we get D = 4 — B E and the theorem 
is proved. 


The great significance of this theorem is that it shows that the three 
counterterms we have identified are the only ones that will ever appear. 
The fact that D = 4—Be means that the only diagrams that diverge (i.e. 
with D > 0) have < 4 external legs. There are no allowable diagrams 
with one or three legs and we’ve considered the divergences for the two 
and four leg diagrams. We’ll never need any more. 
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32.6 Which theories are renormalizable? 

Renormalizable theories are those in which a finite number of countert¬ 
erms cancel all divergences. Unfortunately, there are theories for which 
this property does not hold true. An example is Fermi’s theory of weak 
interactions between fermions, whose Lagrangian is given by 

C = — m)ip + (32.32) 

where if? is the fermion field. 5 The superficial degree of divergence of this 
theory is given by 

D = 4 - ^F e + 2V, (32.33) 

where F E is the number of external Fermi lines in the diagram. This has 
the unfortunate feature that D depends on V, the number of interaction 
vertices. If we consider fermion-fermion scattering, for which F E = 
4 then for V > 1 we have divergent diagrams, which become more 
divergent as V gets larger. We would need new counterterms at every 
order of perturbation theory. 

In fact, whether a theory is renormalizable or not may be read off 
simply from the dimensions of the interaction coupling constant. The 
rules are 

• A super-renormalizable theory has only a finite number of superfi¬ 
cially divergent diagrams. Its coupling constant has positive mass 
dimension. 

• A renormalizable theory has a finite number of superficially diver¬ 
gent diagrams, however, divergences occur at all orders of pertur¬ 
bation theory. Its coupling constant is dimensionless. 

• A non-renormalizable theory has diagrams that are all divergent 
at a sufficiently high order of perturbation theory. These theories 
have a negative mass dimension. 

In our units the action S = f d 4 a: C must be dimensionless since e lS 
appears in the path integral. We therefore need C to have dimensions 
[Length] -4 . Also note that, using these units, mass and energy have 
dimensions [Length] -1 . Our 0 4 theory has a Lagrangian C = |(d /i <^) 2 — 
^cj) 2 — j)</> 4 from which we conclude that [<j>] = [Mass] = [Length] -1 and 
A is dimensionless. The ()> 4 theory is therefore renormalizable. On the 
other hand, looking at the mass term in Fermi’s theory we can conclude 
that [■;/’] = [Mass ]2 and that, therefore, [G] = [Mass] -2 . The theory is 
non-renormalizable. 

The reason for the rule may be seen if we ask about the momentum 
dependence of scattering in Fermi’s theory for small external momenta. 
The lowest order contribution will vary as AIi ~ G. The next order 
is G 2 and so that the units agree with M\ we require M 2 ~ G 2 p 2 
or ~ G 2 A , when we integrate. This is divergent when we send A —» 
00 . Things only get worse as we go to higher orders. This increase in 
powers of momentum will occur for any theory with a coupling which has 


5 

The meaning of the cross through the 
momentum operator p and the bar over 
xj> will be explained in Chapter 36. 
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negative mass dimension. The disaster clearly doesn’t occur for theories 
with dimensionless couplings, where no extra momentum dependence is 
required. For super-renormalizable theories, we need momentum factors 
raised to negative powers, improving convergence at large momenta. 

Finally we note that this argument also allows us to see that non- 
renormalizable interactions in Fermi’s theory only cause trouble when 
A is larger than « G _1 . That is, if we’re interested in physics at much 
lower energies than G^ 1 then we never explore momenta between G -1 
and infinity and so the presence of the non-renormalizable term in the 
Lagrangian wouldn’t cause us trouble. It may be that all of the theories 
of Nature that, at one time, we believed to be renormalizable (including 
quantum electrodynamics and the electroweak theory) actually contain 
non-renormalizable terms in their true Lagrangians. It’s just that all of 
our experience is limited to energies where we don’t notice them since 
their coupling constants Gj are so small that our experiments cannot 
reach the energies of G ~ 1 . In this way of looking at the world, our theo¬ 
ries of Nature are low-energy, effective theories, which will eventually 
break down at high enough energies. The true theory of Nature may 
not, therefore, be a quantum field theory at all... 


Chapter summary 

• Quantum field theory contains divergences and the solution to the 
problem is renormalization. Counterterms can be added to the 
Lagrangian to remove the divergences and these counterterms have 
the effect of shifting parameters in the original Lagrangian. 


Exercises 

(32.1) Consider a theory described by the Lagrangian 
r = i(S M 0) 2 -(m 2 /2)^-( 5 /3!)0 3 . 

(a) Write a renormalized Lagrangian and determine 
the relationships between the bare and renormal¬ 
ized parameters. 

(b) Working in (5 + l)-dimensional spacetime, de¬ 
termine the superficial degree of divergence of the 
two diagrams shown in Fig. 32.5. 

(c) Suggest Feynman rules for the counterterms in 
the renormalized theory. 



Fig. 32.5 Two divergent diagrams occurring in the¬ 
ory. 




Renormalization in action: 
propagators and Feynman 
diagrams 



Now we are going to see yet another way in which the whole 
world is built out of recursion ... We are going to see that 
particles are - in a certain sense which can only be defined 
rigorously in relativistic quantum mechanics - nested inside 
each other in a way which can be described recursively, per¬ 
haps even by a sort of ‘grammar’. 

Douglas Hofstadter (1945- ), Godel, Escher, Bach 

Some of the most useful tools in quantum field theory are propagators 
and Feynman diagrams. We’ve seen in the previous two chapters how 
renormalization affects single particles and Lagrangians. Now we’ll ex¬ 
amine how renormalization affects Feynman diagrams and propagators. 
This approach provides a particularly vivid picture of how a particle 
dresses itself in interactions. The main features of the propagator ap¬ 
proach to renormalization is that the dressing-up process is implemented 
with two new Green’s functions. 1 The first results from the single particle 
interacting with the vacuum as it propagates through spacetime. These 
interactions gives rise to a Green’s function we call the self-energy. 
The self-energy describes the changes to the particle’s mass caused by 
interactions. The second new Green’s function comes from the virtual 
fluctuations screening the interactions between particles. The screening 
is described by the vertex function T. This screening changes the 
coupling constant of the theory. 
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^The beauty of describing renormaliza¬ 
tion in terms of Green’s functions is 
that each of these new Green’s func¬ 
tions may be written as a sum of Feyn¬ 
man diagrams. 


The notation is unfortunate, but please 
try not to confuse the vertex function 
r and the quasiparticle decay rate T p . 


33.1 How interactions change the 

propagator in perturbation theory 

We will discuss the usual example of a scalar field theory with </> 4 inter¬ 
actions, described by a Lagrangian 

£ = ^) 2 - - ^ 4 , (33.1) 

where we’ve written the theory in terms of bare fields and parameters. 

When there are no interactions present, the amplitude for a particle 
with momentum p to propagate between two points is given by Go(p) = 
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(a) (b) 

JUL 

-e- 

- 8 - V 

Fig. 33.1 (a) Some 1PI diagrams from 
</> 4 theory. They can’t be turned into 
two meaningful diagrams by cutting 
one propagator line, (b) Diagrams that 
aren’t 1PI. All can be turned into legit¬ 
imate 1PI diagrams by cutting at some 
point along the horizontal line. 


2 

Note that the 1PI self-energy does not 
include an overall energy-momentum 
conserving 5-function. We also ampu¬ 
tate the external lines, that is, we don’t 
include propagators for these lines. 

— 

= AL + 

+ - 0 - + 

+ -§- + 

H- 



Fig. 33.2 Some contributions to the 
1PI self-energy — iS (p) in 0 4 theory. 


i/(p 2 —m 2 +ie). We saw previously (without using perturbation theory at 
all) that when you turn on interactions particles become dressed particles 
described by a propagator 


G( P ) 


i Z n 


p 2 — mp, + ir. 


+ 


Multiparticle 

parts 


(33.2) 


1 /2 A 

where = (p>|</>(0)|n) tells you how much particle we have in the 
momentum state \p\), T p is the particle decay rate and mp is the phys¬ 
ical particle’s exact mass. Notice that the pole is found at p 2 = nip 
and the residue of the pole is i Z p . We now take a propagator for non¬ 
interacting particles with bare mass m and examine how it changes when 
we add an interaction to our theory as a perturbation. Since we know 
that the exact answer is given by eqn 33.2, we know how to extract the 
exact mass mp of the particle from any propagator we calculate with 
perturbation theory: we simply look for the pole. We also know how to 
find the quasiparticle weight Z p : we find the residue at the pole. We 
therefore have a very important rule: 

The physical mass of a particle in an interacting theory is given by 
the position of the pole in the propagator. The quasiparticle weight 
is found from the residue at the pole. 

In perturbation theory the propagator can be found by adding up Feyn¬ 
man diagrams. That is 


\ ' 


G(P) = ^ 


All connected Feynman diagrams 
with two external legs 


(33.3) 


One helpful way to think of the propagator is to picture it as being made 
up of the two external legs and everything we can slot in between those 
two legs. What we can slot in is called the self-energy, and we describe 
that using what is called a 1-part irreducible diagram (or 1PI di¬ 
agram for short), which is defined as a connected diagram which cant 
be disconnected by cutting one internal propagator line. Some examples 
of 1PI diagrams in g i 4 theory are shown in Fig. 33.1(a) and some exam¬ 
ples of diagrams that aren’t 1PI are shown in Fig. 33.1(b). These latter 
diagrams contain cutlines, and so fall apart into disconnected pieces if 
you attack one of the cutlines with a pair of scissors. In contrast, the 
connectedness of an 1PI diagram is immune to a single scissor attack on 
one of their internal lines. They essentially represent the smallest non¬ 
trivial Feynman diagram and a basic building block of more complex 
diagrams. They are, if you like, the guts of the self-energy. 

The sum of 1PI diagrams with two external legs forms a Green’s func¬ 
tion known as the 1PI self-energy E(p), given by 2 


-iS(p) = E 


All amputated 1PI diagrams 
with two external lines 


(33.4) 


Some contributions to —i£(p) in <fr 4 theory are shown in Fig. 33.2. In 
the following example, we will explore how the propagator G(p) can be 
expressed in terms of the 1PI self-energy £(p). 
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Example 33.1 

The definition of E leads to one of the neatest tricks in perturbation theory. We can 
write the interacting propagator as a series of encounters with an interaction repre¬ 
sented by E. This is the story of how the propagator dresses itself: it’s the amplitude 
for propagating with no interactions, added to the amplitude for propagation being 
interrupted by one interaction with E, added to the amplitude for two interactions 
with E, added to... The series, shown in Fig. 33.3, is written as follows: 





G(p) = 


- [-iE(p)l 

p z — m z l -i p z — m z 

b if:(p) ] 2 [~ i£(p) ] 


We now treat this as a geometric series and sum: 



Fig. 33.3 Diagrammatic version of 
(33.5) eqn 33.5. 


G(p) 


a * 2 {1 + f-iS(p)l 2 * a 

p z — m z f L J p z — m z 

+ [~ lS (P)] p 2 _ m 2 [“^(P)] p 2 _ , m 2 + • • } 


p 2 — m 2 — E(p) + ie 


(33.6) 


where, in the last line, we’ve reinserted the ie from the free propagator. The sum can, 
more amusingly, be done in terms of diagrams as shown in Fig. 33.4. Equation 33.6 is 
another instance of Dyson’s equation, which we met in Chapter 16. The form of the 
equation makes it look a lot like the free propagator, albeit with an extra self-energy 
term in the denominator. 

To find the mass-energy mp of the physical particles we just follow our renormal¬ 
ization rule: we look for the position of the pole. It is found using the equation 


p 2 — m 2 — Re ^E(p)^ = 0, 


(33.7) 


when p 2 = mp. That is to say that the physical mass of the particles is given by 


m p = m 2 + Re ^E (p 2 = mp)J , (33.8) 

which tells us that the real part of the self-energy tells us the shift in energy caused 
by interactions. The latter equation is known as a renormalization condition. 3 


33.2 The role of counterterms: 

renormalization conditions 



Fig. 33.4 Diagrammatic version of 
eqn 33.6 


3 

It allows us, at the level of Feynman 
diagrams and perturbation theory, to 
renormalize a theory. By the same to¬ 
ken, the decay rate of the physical par¬ 
ticle is found from Im (s(p 2 = m p)V 


But what of counterterms in this way of looking at things? Thus far 
we’ve summed all of the 1PI self-energy diagrams to make an object 
S(p). In the absence of counterterms, the role of the self-energy E(p) is 
to shift the mass from m to mp. However, we have ignored the fact that 
many of the diagrams that contribute to E(p) will be divergent, making 
the shift in mass seem dangerously infinite! One way to interpret this is 
to say that m must be infinite and requires an infinite shift to bring it 
down to the physical value mp. However all this talk of infinity should 



298 Renormalization in action: propagators and Feynman diagrams 


^In the theory of the electron gas, 
where we do not encounter divergences 
to the extent that we do in some other 
theories, the first option is the logical 
one. In quantum electrodynamics the 
second option is the one to choose. 



function is contained in the Greens 
function shown here, corresponding 
to (O|T0 4 5'|O)/(O|S'|O), or equivalently 
G(pi)G(ps)[-if(pi ,P2,P3 )]g(p 3 )G(P4). 
Diagrammatically, the vertex part —iF, 
the central blob in the diagram above, 
may be extracted by amputating the 
external propagator legs. 

5 

We could also define a 1PI vertex func¬ 
tion, which is useful for many applica¬ 
tions, but we won’t need it here. 

X 


Fig. 33.6 Some contributions to the 
vertex function —if in 0 4 theory. The 
vertex function is represented as a 
shaded blob with four external legs. 


make us nervous and we would be better to remove divergences as we 
did in the previous chapter. 

To remove the divergences in E(p) we should really sum all of the self¬ 
energy diagrams along with counterterms in order to cancel infinities. 
Recall that the effect of including counterterms was to shift the mass 
of the theory from the unphysical fictional value of m to the measured 
and reliable value mp. Using the 1PI self-energy E(p), we won’t need to 
shift the mass at all! That is, when we include counterterms we require 
the renormalization condition 

Re (E(p 2 = ml )) = 0. (33.9) 

We see that there are two ways of doing business. We may use the 
renormalization condition eqn 33.8, which neglects counterterms with 
the result that the shifts from fictional to real masses are potentially 
infinite. On the other hand we can use the renormalization condition in 
eqn 33.9 which allows us to start with the mass set to the correct value 
and we ensure that it doesn’t change as we do our perturbation theory. 4 


33.3 The vertex function 


The next thing to do is to work out how to renormalize the coupling 
constant. Physically, this is changed because vacuum fluctuations screen 
the interactions between two particles. The interaction vertex in <j > 4 
theory has the Feynman rule 

(Vertex) = (27 t) 4 5 (4) (?N + p 3 - P 2 ~ Pi)(-iA), (33.10) 


that is, we get a factor of —iA for every vertex and the 5-function ensures 
that momentum is conserved. The vertex is plugged in between four 
external lines to tell us about two particle scattering. 

We define the vertex function f for <j > 4 theory by 5 



All connected four-point diagrams 
with external legs amputated 


(33.11) 


This is shown diagrammatically in Fig. 33.5. (Note that —if does not 
include an overall energy-momentum conserving 5-function.) Like the 
< i i 4 interaction vertex, the vertex function may be plugged in between 
four propagators. It is designed to tell us how interactions (involving 
virtual particles) affect how real particles interact with each other. Some 
examples of contributions to — IT are shown in Fig. 33.6. 


Example 33.2 

For two-particle scattering F can be obtained from the amplitude for the two-particle 
scattering process. To first order in the coupling, the four-point vertex function is 
given by a single interaction vertex: - iF = —iA. To second order in the coupling 
we eliminated divergences using a counterterm (see Chapter 32) and saw that the 
two-particle scattering amplitude was given by 

— if (pi,P 2 ,P 3 ) = —iA — iaA 2 (Ins + Inf + Inu), 


(33.12) 



33.3 
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that is, it depends on the momenta of the incoming and outgoing particles 
[parametrized by s = (pi + p 2 ) 2 , t = (p\ — p;>,) 2 and u = (p 2 — P 3 ) 2 ]. 


The last example demonstrates that the exact vertex function 
r(pi,P 2 ,P 3 ) is a function of the incoming and outgoing momenta. (Of 
course momentum conservation means we only need give three of the 
four momenta in the problem.) In order to define a renormalization 
condition for T we need to choose the values of these momenta at which 
to fix T to some value. However, unlike mass which is unambiguously 
defined, there is no unique definition of the coupling constant. We are 
therefore free to choose a renormalization condition at our convenience. 
We finally write the definition of the physical coupling constant: 


( Amputated, connected diagrams with N 
four external legs 
and momenta p \, P 2 and P 3 y 

(33.13) 

where the values of pi, p 2 and p$ that we choose are called the renor¬ 
malization point. The sign is chosen so that to first order Ap = A. 


Example 33.3 

We can use the definition of Ap to calculate the amplitude for two-particle scattering 
up to second order in unrenormalized perturbation theory (i.e. where we haven’t used 
counterterms to remove the cut-off). If the physical coupling constant Ap is defined 
to be that measured at a renormalization point so, to, uo then, to second order, we 
have that 


In Chapter 41 we will use the analogous 
Green’s functions E and T to assess how 
electromagnetic interactions change the 
properties of photons and electrons in 
quantum electrodynamics. In Chap¬ 
ter 43 we will examine the renormaliza¬ 
tion of the theory of metals in terms of 
the self-energy of electrons and the pho¬ 
tons that mediate the Coulomb force 
that acts between them. 


— iAp = — if (so, to, ilq) = —iA + iaA 2 (3 In A 2 — lnso — Into — lntio) • (33.14) 


The two-particle scattering amplitude is given, for particles with momenta s, t and 
u, by 

LM = —iA + iaA 2 (3 In A 2 — In s — In t — In u) . (33.15) 

At the level of approximation to which we’re working we may use eqn 33.14 to write 


—iA = —iAp — iaAp (3 In A 2 — lnso — Into — lntto) + 0(A 3 ), 
which allows us to eliminate A (and A) from eqn 33.15 to give the answer 


i J\4 = —iAp — iaAp 


so 


In — + In - 


to 


I u 
- In ( — 


uoj J 


0( A 3 ). 


(33.16) 


(33.17) 


We see that we obtain the amplitude in terms of the physical coupling constant Ap, 
where momenta are measured relative to the renormalization point. This argument 
may be repeated using renormalized perturbation theory (i.e. with the use of coun¬ 
terterms, which eliminate all In A terms) to obtain an identical answer, but without 
the troubling business of carrying around the (potentially infinite) A-dependent term. 
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Chapter summary 


• The self-energy E can be related to the sum of all 1-part irreducible 
(1PI) diagrams with two external legs. 

• In <^> 4 theory, the vertex function T can be related to the sum of all 
four-point diagrams with external legs amputated. 


Exercises 


(33.1) To find the quasiparticle weight we can expand the (33.4) Consider the action derived for a one-dimensional 
self-energy in a Taylor series. We’ll do this about 
the interesting point p 2 = nip. 


lattice in (1 + l)-dimensional spacetime 

s=-^rf 


E(p 2 ) « S(nip) + (p 2 - top) 


(a) Using this expansion show that 

G( P ) «- 


4>- p (—u>)(uj 2 —u>o+ujq cospa) <t>p(co). 


(p 2 - ml) 


1 - 


dS(p 2 ) 

dp 2 


+ . . . 

m 2 

(33.18) 

. (33.19) 


(b) Use this to show, to the order to which we’re 
working, that 


1 + 


dS(p 2 

dp 2 


(33.20) 


_ _ .2 .2 , .2 

(33.21) 

We will treat the final term in the bracket as a per¬ 
turbation. 

(a) Identify the free propagator and the interaction 
vertex. 

(b) Write the full propagator as a sum to infinity 
involving the free propagator and interaction term. 

(c) Show that, on carrying out the sum, the ex¬ 
pected full propagator is recovered. 

(33.5) Consider a model of an atom made up of two energy 
levels (Ei > E\ > 0) occupied by a single fermion 
and described by a Hamiltonian 

H = i?ic|ci + E 2 & 2 C 2 + V (c|c2 + C 2 C 1 ). (33.22) 


This gives us a scheme for calculating Z, the quasi¬ 
particle weight, in terms of the quasiparticle self¬ 
energy. 

(33.2) (a) Draw diagrams showing the contributions to the 
self-energy —iE(p) of psions for ^>'^0 theory up to 
fourth order in the interaction. 

(b) Draw the contributions to the vertex function 
—if in 1 p^tpcj} theory up to third order in the inter¬ 
action. 

Here the vertex function includes all amputated in¬ 
sertions with one psion line, one antipsion line and 
one phion line. 

(33.3) Repeat the argument in Example 33.3 using renor¬ 
malized perturbation theory (i.e. including the use 
of counterterms to remove the momentum cut-off 
A). 


(a) Treat the non-diagonal term as a perturbation 
and find the free propagators (0|Tci(f)c}(t , )|0) and 
{0\Tc 2 (t)cl(t')\0) for electrons in level |1) and \2) 
respectively. 

(b) Now consider the interaction term. Find the 
Feynman rule for the processes that contribute to 
the self-energy and invent appropriate Feynman di¬ 
agrams to describe an electron in the system. 

Hint: the potential here is time independent, so the 
process to consider involves the electron changing 
states and immediately changing back. 

(c) Find the self-energy of the electron in each state 
and show that the interaction causes a shift in the 
energy of the particle in level 11) by an amount 


E2 — Ei 


(33.23) 




Exercises 301 


(d) Compare this result with (i) the exact solution 
and (ii) with second-order perturbation theory. 

*(e) Harder In the time-dependent version of the 
problem the interaction part is given by 

H ' = J 

(33.24) 

By evaluating the self-energy of an electron in each 
of the levels, show that the decay rates of the levels 


are given by 

214 = 2 -kV( uj 0 ) 2 (n{uJo)) j 5(q° - wo), 
214 = 27rC( Wo ) 2 [l + <n(wo))] J - 

where u>q = E 2 — E\. 


wo), 
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^The renormalization group is a bit of 
a misnomer as it is not really a group. 
The name arises from the study of 
how a system behaves under rescaling 
transformations and such transforma¬ 
tions do of course form a group. How¬ 
ever, the ‘blurring’ that occurs when 
we rescale and then integrate up to a 
cut-off, thereby removing the fine struc¬ 
ture (and this is the very essence of 
the renormalization group procedure) 
is not invertible (the fine details are 
lost and you can’t put them back). 
Thus the transformations consisting of 
rescaling and integrating up to a cut-off 
do not form a mathematical group be¬ 
cause the inverse transformation does 
not exist. 


Kenneth Wilson (1936-2013) was 
awarded the Nobel Prize in Physics in 
1982 in recognition of ‘his theory for 
critical phenomena in connection with 
phase transitions’, a theory that grew 
out of his work on the renormalization 
group. 


The renormalization group 


Everything is in motion. 

Everything Rows. 

Everything is vibrating. 

William Hazlitt (1778-1830) 

A question we keep asking in physics is ‘when is a theory valid’. In the 
context of quantum held theory an answer is provided by the philosophy 
of the renormalization group. 1 This is a way of looking at the world 
pioneered by several physicists, but most notably Kenneth Wilson. It 
allows us to make sense of why a renormalized quantum held theory 
describes Nature. 


34.1 The problem 

Theories are described by Lagrangians. These are the sum of several 
terms, each of which is some combination of helds and their derivatives, 
multiplied by a so-called coupling constant. For (/> 4 theory, we have the 
Lagrangian 

£ = ~ ( 34 - 1 ) 

where m and A are the coupling constants. Calculations that start with 
a Lagrangian like this soon encounter disaster in the form of divergent 
(i.e. infinite) integrals. That is, we take our Lagrangian, canonically 
quantize the free part, treat the interacting part as a perturbation to 
define an 5-operator using Dyson’s equation and then start working out 
amplitudes in the form of 5-matrix elements which lead to Feynman 
rules. If a Feynman diagram involves a loop, we often get a divergence. 
To cure a divergence, we can introduce a finite cut-off in momentum A 
in such a way that we integrate up to A rather than infinity. In the 
previous chapters our strategy was then to renormalize, that is to say 
that we remove the mention of A from things that we calculate. This 
involves expressing amplitudes in terms of physical coupling constants, 
which are the ones we measure at some agreed point in momentum space. 
Our definitions of coupling constants depend on this point in momentum 
space. 

Kenneth Wilson had another strategy: rather than hiding the cut-off, 
we live with it. In Wilson’s view, to properly define the theory, we need 
to admit that we do our integrals up to a maximum momentum A, which 
we are entirely free to choose. The arbitrary choice of A immediately 
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raises the question: how do we know what value to choose? One answer 
is that we want the length scale A -1 to be far smaller than the length 2 
scale p- 1 of any of the physics in which we’re interested. Consider, for 
example, a gas of atoms in a box. If we’re interested in sound waves, 
which are oscillations in the number density of the constituents of the 
gas involving many millions of atoms, then the scale in which we’re 
interested in p~ x is of the order of centimetres and we could take A -1 
to be a few microns. If we’re interested in the electron cloud of an atom 
in the gas, then we’re interested in a length scale p -1 of the order of 
the size of an atom and we could take A -1 to be the size of an atomic 
nucleus. The size of A -1 that we choose therefore changes depending on 
the scale p~ l of the physics that we’re analysing. 

The renormalization group is a machine that tells us how the predic¬ 
tions of the theory change as we alter the scale of interest. It gives us an 
equation telling us how a coupling ‘constant’ changes with the length or 
momentum scale in which we’re interested. To summarize this chapter: 

The goal of renormalization group analysis is to discover how the 
coupling constants change with the scale of interest. 


o 

Remember that in units where h = 
c = 1, mass m, energy E and momen¬ 
tum p have inverse units to length. 


Example 34.1 


In general, this goal may be achieved by examining how your Lagrangian behaves 
when the cut-off A is systematically varied. This approach will be outlined later. It is, 
however, sometimes possible to leapfrog the systematic method to get an equation for 
the coupling constant in terms of the scale of interest. We’ll follow such an approach 
here first. Recall how we defined the renormalized coupling constant in 0 4 theory. 
We said that Ap was the measured scattering amplitude when the particles had 
squared momenta so, to and uo, which is to say, the coupling constant is defined at 
a particular point in momentum space. How do we choose this point? A pragmatist 
would say that you choose it to be a convenient energy scale for the problem you’re 
addressing. If we’re dealing with electrons then we could set so = to = uo = p 2 where 
p is close to the electron mass. If we’re dealing with pions, we’d use p close to the 
pion mass and so on. However, this ‘idiot-proof’ scheme could easily be challenged 
if an ingenious idiot started thinking about electron physics but set p equal to the 
pion mass. 

As we saw in Chapter 33 the physical coupling constant is given to second order 


by 

—iAp (so, to, uo) = —iA + iaA 2 (3 In A 2 — In so — Into — lnuo) . (34.2) 

This allowed us to write the two-particle scattering amplitude to second order: 


i M = —iAp (p) + ia[Ap(^)] 2 


In 


-In 


-In 


(34.3) 


which is to say that it’s given by the physical coupling constant added to a logarithmic 
correction. This correction is small since we chose p 2 to be of order so, to and uo- 
If, on the other hand, we idiotically choose a renormalization point p' 2 to be much 
larger than so, to and uq , then the logarithmic term could be very large compared to 
the first term. Then we would have 


i M. = —iAp (//) + ia[Ap(//)] 2 


In 


- In 


- In 


(34.4) 


where the second term is large compared to the first. Subtracting, we can find an 
expression for Ap(//) in terms of Ap (p): 




A P (/i) + 6a[A P (/i)] 2 In 



(34.5) 
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Remember that [Mass] 1 
[Length] -1 . 


92 



Fig. 34.1 The renormalization group 
flow in a configuration space defined 
by two coupling constants g\ and (72 • 

^ Alternatively there is nothing stop¬ 
ping us examining the theory at short 
length scales in which case we could set 
6 < 1 . 

4 This is, of course, a different ‘Gell- 
Mann-Low equation’ to that described 
in Chapter 22. See M. Gell-Mann and 
F. Low, Phys. Rev. 84, 350 (1951) 


5 


Sometimes one can write 


m = 


d 9 
dfc 


instead. Since b divides A, the rescaling 
is more usefully modelled by looking at 
how g changes with In b. But, as we 
shall see in this chapter, different defi¬ 
nitions can, and are, made. 



Fig. 34.2 Four examples of flows in 
a configuration space defined by a sin¬ 
gle coupling constant g , visualized by 
plotting /3(g) = Arrows show the 
direction that the length scale £ —> 00 . 


which is expressed in differential form thus: 

/a — Ap(/r) = 6aAp(^t) 2 + O(Ap). (34.6) 

dg 

So in this case, we have a simple equation that tells us how the coupling A changes 
with the momentum scale g. 

If we are interested in high energies, we can examine how A behaves as g, rep¬ 
resenting the scale of interest, gets larger and larger, that is, as g —» 00 . We call 
this the ultraviolet behaviour. Equally well, we might be interested in the physics 
at very large length scale, which would involve asking how A changed as g —> 0. 
This is known as the infrared behaviour, since it corresponds to the limit of small 
energies. 


34.2 Flows in parameter space 

A theory is characterized by { 5 ;}, the set of coupling constants that 
measure the strength of the various interactions. (For (j) A theory we 
have 52 = m and <74 = A.) Each coupling constant is a function of the 
cut-off A, so we can write the ith coupling constant as <?*(A). We are 
going to examine what happens as we look at different length scales, 
which is achieved by changing the value of the cut-off. If we reduce the 
cut-off by a factor b , then we want to see how the coupling constants 
change and so we will look at the transformation 

Si(A) - ft(A/6). (34.7) 

If we are interested in the limit of large length scales (corresponding 
to what is measured in condensed matter physics) then we choose the 
direction of positive flow by fixing b > 1. This is known as coarse 
graining. 3 A theory can be described as a point in the multidimensional 
space formed by all of the { 5 ,}, i.e. by the point ( 51 , 52 ,' ■■) in that 
space. The rescaling of the cut-off causes the point to travel through this 
multidimensional space along a renormalization group trajectory, 
and this is called a renormalization group flow (see Fig. 34.1). We 
will sometimes find fixed points in this space; these remain invariant 
under coarse graining and correspond to scale-invariant Lagrangians. 
We will see that such Lagrangians are very important in determining 
the physics described by the theory. 


Example 34.2 

A very simple example of renormalization group flow occurs for a theory when there 
is only a single coupling constant g. A graph with only one axis is hard to illustrate 
(much like the problem with the sound of one hand clapping) so to illustrate the flow 
it is conventional to plot the function 


m 


d g 

din b’ 


(34.8) 


which is known as a Gell-Mann—Low equation 4 or simply a flow equation. 5 
Note that: 
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• If (3 is always positive then g will be blasted out to infinity, as shown in 
Fig. 34.2(a). If (3 is always negative then g will be sucked backwards to g = 0 
as shown in Fig. 34.2(b). 

• If at position g = g* we have /3(g*) = 0 then g is stuck and will remain at g* 
for evermore. We call g* a fixed point. 

• Even if a fixed point exists the system may never get stuck there. If the 
velocity function (3 looks like Fig. 34.2(c) then if g > g* the system will end 
up at oo, while g < g* will be drawn back to g = 0: here g* is known as a 
repulsive fixed point. 

• If the function (3{g ) looks like Fig. 34.2(d) then the system will be attracted 
towards g* regardless of its initial condition. There are no prizes for guessing 
that under these circumstances g* is known as an attractive fixed point. 


34.3 The renormalization group method 

Now that we know what it’s useful for, we’d like to know how the renor¬ 
malization group method works in more detail. We start with the func¬ 
tional integral, written in Euclidean space 

Z{ A) = [ V<j)e~^ ddxC[4,] , (34.9) 

J A 

where A is an instruction to integrate over configurations of <j)(x) con¬ 
taining Fourier components up to A. 

We will use the renormalization group method to examine the physics 
as we increase the length scale of interest. The method involves three 
steps. In step I we’re going to remove the largest Fourier components of 
momentum. This is achieved by doing that part of the functional integral 
that involves the largest Fourier components in momentum space. The 
removal of these components is equivalent to losing one’s glasses. A 
certain level of fine detail is lost and we have a simpler looking held 
(Fig. 34.3). We then compare the theory to what we started with. Since 
we now seem to be comparing apples (the original integral) and oranges 
(the partly integrated integral) we need to scale up the partly integrated 
integral so that it is defined over the same momentum range as the 
original. To do this involves (step II) relabelling the momenta and 
then we (step III) scale the size of the fields so that the Lagrangian 
resembles what we had before. The beauty of the procedure is that, in 
successful cases, it will not change the form of the Lagrangian, only the 
coupling constants. We then imagine repeating the procedure over and 
over again (losing successive pairs of spare glasses) and extract how the 
couplings change as we partially integrate the functional integral. 

We now demonstrate the procedure, which is illustrated in Fig. 34.4. 
First we need to set the scene. We choose to work in momentum 
space, where we divide the fields 4 >{p) into those parts that vary slowly 
with p [called (j> s {p)} and those parts [called (j>f(p)] that vary quickly 
(see Fig. 34.5). We define the fast Fourier components as those in the 
momentum shell A/6 < |p| < A, where 6 > 1 is a scaling factor, which 



Fig. 34.3 Removing the largest Fourier 
components results in a loss of detail. 



Fig. 34.4 The renormalization group 
process in momentum space, (a) The 
field <f)(x ) before the process, (b) The 
result of integrating out the largest mo¬ 
mentum states (step I) is to remove the 
high spatial frequency components, (c) 
The field after rescaling (steps II and 
III). Note that the region in the dashed 
box is identical to the entire trace in 
(b). 
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Fig. 34.5 Components in momentum 
space are divided up into slowly varying 
and fast varying, with any component 
with |p| > A being ignored. 
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6 We therefore have 

4>(p) = <j>{ (p) for A/6 < p < A, 
<j>{p) = <f> s (p) for 0 < p < A/6. 


will be central to our analysis. The slow Fourier components of the 
fields are then those with momenta 6 0 < p < A/b. The functional 
integral becomes 

Z(A) = [ V<j> s e-S AdxC ^ [ V<j> t e-S AdxCl M°’ (l,t % (34.10) 

JA/b JA/b 


We’re now ready to start the renormalization group procedure. 

Step I: This is the difficult step. We need to integrate over the quickly 
varying fields. This is usually impossible, so we need an approximation 
scheme involving perturbation theory. Assuming we can do this, we’ll 
obtain an action in which the remaining slow fields and coupling con¬ 
stants are different to what they were before. If we can (somehow) do 
the integral over the fast varying part <f >f we define the answer to be 



/ d d a:,Ci[0 s ,0f] 


— e ~ f d d xSC[4> e ] 


(34.11) 


leading to the result for our integral: 


Z{A)= f Vcj) s e~ f d<ix ( £ W , »]+ i£ W , »]), (34.12) 

JA/b 



A 


Fig. 34.6 Rescaling the momentum. 

•7 

The subject is notorious for its arcane 
terminology, and d^ is known as the 

anomalous dimension. 


^ Sending i to infinity corresponds to 
looking at the infrared behaviour of the 
couplings (since large length is the same 
as small energy). We could send i —> 0, 
if we want the ultraviolet behaviour. 


which only features the slow fields. 

Step II: We relabel the momenta. We do this by expressing our 
theory in terms of scaled momenta p' = pb. Since b > 1 this has the 
effect of stretching the momentum space out over the range we had 
originally, as shown in Fig. 34.6. This is seen by examining the limit of 
the integral, which is p'/b = A/b or p' = A. 

Step III: Finally we scale the fields. We select the term that we 
imagine will be most important in determining the scaling behaviour 
and require it to be invariant with the scaling of step II. We use this 
to rescale the field <j>(p) = /b) = b d ~ d,t, (j)'(p'), where d$ is chosen to 

leave the important term invariant. 7 

Now that the procedure has been carried out once, we repeat it an 
infinite number of times. In fact, we only need to carry out the procedure 
once and then infer the consequence of repeating it. That is, we do the 
approximate integral which tells us how all of the coupling constants 
change with scale. We then imagine a continuous flow of the coupling 
constant caused by repeating the renormalization group method again 
and again. 

By removing the fastest varying fields we are looking at the physics 
at lower momentum, or equivalently at longer length scale. We can 
therefore ask how the physics changes as we send the length scale of 
interest £ to infinity. One way to do this is to set b = e f and search for 
flow equations of the form 

= Pi(gi,92,-,gN)- (34.13) 

This is another form of the Gell-Mann Low equation. 8 
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The renormalization group is most famously applied to phase transi¬ 
tions, an example we will defer until the next chapter. Here we focus 
on three examples which illustrate other features of the renormalization 
group. 


34.4 Application 1: asymptotic freedom 

Perturbation theory is carried out in terms of series expansions in cou¬ 
pling constants. The renormalization group teaches us that coupling 
constants change depending on the region in energy-momentum space 
in which we’re working. If we work in a region of momentum space 
where the coupling constants are small, then perturbation theory will 
work well. If, however, we work in a region where the coupling con¬ 
stants are large then we can expect perturbation theory to fail to yield 
sensible results. The coupling ‘constant’ in quantum electrodynamics 
is the electronic charge. As we’ll see in Chapter 41, to first order, the 
P function describing the evolution of e is written 9 = |e| 3 /127r 2 , 
where g is the energy scale of interest. The important thing to note 
is that the right-hand side is positive. This means that as we flow to 
large energy-momentum transfer (by increasing the mass-energy scale 
/i), the coupling constant gets larger and larger. Eventually, when the 
momentum involved in the processes we’re calculating implies a very 
large mass-energy scale, perturbation theory ceases to be sufficient to 
capture the physics. Of course, running this process backwards (cor¬ 
responding to examining processes involving small momentum transfer 
and large length scale) the coupling gets smaller and smaller and per¬ 
turbation theory captures the physics better and better. This confirms 
the expected result that perturbation theory works best for processes 
involving small momenta and energy and gets worse as the energy gets 
larger. It also confirms the physical picture that electric charge is largest 
close to a charged source (that is at small length scales), and dies away 
at large distances. 10 

A surprising result was discovered for a class of theories, which in¬ 
cludes the famous non-abelian Yang-Mills theory that will be examined 
in Chapter 46. In this case, f3 is negative! Therefore, if we flow to large 
momentum transfer, the coupling constants become small and perturba¬ 
tion theory at low orders is a very good approximation. However, now 
if we reverse the process, allowing the momentum transfer to become 
small, then g gets very large and perturbation theory no longer gives 
good results. Theories of this sort are said to have asymptotic free¬ 
dom : 11 they are most strongly interacting at low energy scales, but at 
large scales they are weakly interacting (and therefore act like nearly 
free theories). 12 

The importance of asymptotic freedom is that it allows us to explain 
(among other things) the physics of the strong interaction. Under the 
normal (i.e. low energy) circumstances of much of the Universe, the 
quarks in a proton are very tightly bound. However, it was found exper- 


9 Notice again that slightly different 
definitions of the flow are used depend¬ 
ing on context. This is nothing to 
worry about since the idea is always the 
same: we want to know how the cou¬ 
pling changes with scale. 

10 This occurs because of screening 
by virtual charged particle-antiparticle 
pairs. See Section 41.1 and Fig. 41.6. 

11 Frank Wilczek now regrets coining 
this term, which was first suggested 
by Sidney Coleman, and wishes he 
had used ‘charge without charge’ in¬ 
stead. The idea is that closer and closer 
to a quark the effective colour charge 
‘asymptotically’ approaches zero. Hav¬ 
ing zero colour charge means complete 
‘freedom’ from its interaction. Playing 
the argument in reverse, at large dis¬ 
tances away from the intrinsically weak 
colour charge of the quark, the cloud 
of virtual particles give rise to anti¬ 
screening and this greatly enhances 
the effective colour charge and results 
in the strong interaction being strong. 


12 

One might think of masses connected 
by springs as showing a sort of asymp¬ 
totic freedom. At small length scales 
they interact weakly; as the length scale 
is increased and the springs become 
stretched they interact more strongly. 
This is why you cannot isolate quarks 
and gluons as free particles, but only 
find them in bound states inside mesons 
(quark, antiquark) or baryons (three 
quarks). 
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13 

The virtual gluon excitations in the 
vacuum carry colour charge and serve 
to reinforce the field, rather than screen 
it. This effect is sometimes called anti¬ 
screening (see above) but is essentially 
another way of describing asymptotic 
freedom. 


imentally in deep inelastic scattering experiments that quarks impacted 
by very highly energetic electrons behave as if they’re free (or, at least 
that the coupling between quarks is very small). This is entirely the 
circumstance described by theories that show asymptotic freedom. The 
strong interaction is described by a theory known as quantum chromo- 
dynamics, which is a non-abelian gauge theory with an internal SU (3) 
symmetry which gives rise to a conserved charge known as colour. 13 
The theory shows asymptotic freedom. We introduce non-abelian gauge 
theories in Chapter 46. 

34.5 Application 2: Anderson localization 


14 See Exercise 34.1. 


David Thouless (1934- ) 

15 

This is another example of how the 
(3 function is defined in various ways to 
suit the problem in hand. 


p 



Fig. 34.7 The renormalization group 
flows for Anderson localization. Arrows 
show the flow for long length scale. 


Crystalline metals conduct electricity, but what happens when you in¬ 
troduce more and more disorder? Philip Anderson realized in 1958 that 
beyond a critical amount of disorder, the diffusive motion that results 
from impurity scattering would completely stop and the electrons would 
become localized, residing in bound states rather than in the delocalized 
band states of the pure metal. This process is called Anderson local¬ 
ization. Thus we can imagine making a metal more and more impure 
by implanting specks of dirt in it and at some point we expect to cross 
from metallic behaviour to insulating behaviour, as Anderson localiza¬ 
tion sets in. This tipping point in behaviour is known as the mobility 
edge. 

Let’s think about the scaling properties of conductivity, since that will 
be important in a renormalization group treatment. A three-dimensional 
metal of volume L 3 conducts electrons. It has a small resistance R and 
a large conductance G = 1 /R. How does the conductance vary with 
the length of a piece of metal? As examined in the exercises 14 G{L) = 
aL, where a is the conductivity. In fact, for a d-dimensional solid, 
G(L) oc. L d ~ 2 . In contrast, an insulator has a small conductance, given 
by G{L) ex e ~ L A where £ is a length scale determined by microscopic 
considerations. What we really want to know is that if we start with 
a material with some fixed amount of dirt in it, do we expect it to be 
a conductor or an insulator? The renormalization group provides the 
answer! For this problem, David Thouless showed that the coupling 
constant of interest turns out to be the conductance G. We define our 
f3 function as 15 


/%) 


ding 

dlnL 


L dg 
gdL’ 


(34.14) 


where the dimensionless conductance is defined as g = HG{L)/e 2 . Dif¬ 
ferentiating our expressions, we obtain a prediction for the scaling in the 
metallic and insulating regions for a d-dimensional solid: 


J (d — 2) metallic (large g) 

1 In g insulating (small g). 


(34.15) 


Patching these limits together we get the flow shown in Fig. 34.7. Notice 
the fixed point at g c , which will turn out to represent the mobility edge. 
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The flow diagram predicts the behaviour of the system as we look at 
larger and larger length scales. This is useful as the limit of large length 
scales is what our measurements of electrical resistance probe. 

A material containing a particular amount of dirt will be an insulator 
or metal, depending on whether the flows takes it to the left (small g) 
or right (larger g) respectively. 16 We see that, for d = 3, we have the 
behaviour we predicted in our naive picture above: if the material has a 
conductance above that at the mobility edge (the fixed point in the flow 
known as g c ) the material is destined to be a metal. If it has so much 
dirt that g < g c it will be an insulator. The mobility edge is represented 
by a repulsive fixed point. The surprise comes when we look in d = 2 
and d = 1 dimensions. In these cases there is no fixed point and hence, 
no mobility edge! If the system has any dirt in it at all it must flow to 
g = 0 and will be an insulator. This is a profound statement! 


16 Note that a flow implying g —> o c 
doesn’t mean that we will measure this 
behaviour in a real system. We would 
expect some other piece of physics, not 
included in our Lagrangian will cut off 
such flows eventually. 


34.6 Application 3: the 

Kosterlitz-Thouless transition 


For any theory of a magnet in two spatial dimensions with a continu¬ 
ous symmetry, the Coleman-Mermin-Wagner theorem (of Chapter 25) 
predicts that there can be no symmetry breaking phase transition to 
an ordered magnetic state. However, a phase transition of a different 
sort is predicted by a remarkable renormalization group analysis. This 
is a topological phase transition, which separates two phases of matter 
containing rather different topological objects. 

We work in two spatial dimensions, and examine vortices, which are 
the topological objects that can exist in (2+l)-dimensional spacetime. 
As discussed in Chapter 29, lone vortices are unstable 17 at T = 0. An 
example of such a vortex is shown in Fig. 34.8. However, it’s a different 
story at high temperatures. This is because, rather than minimizing the 
energy to find the thermodynamic equilibrium state of the system, we 
must minimize the free energy F = U — TS, and so at high temperature 
there is a need to maximize entropy. The presence of a vortex gives the 
system some entropy, since its centre may be placed at any one of a large 
number of lattice sites and this provides a way of minimizing the free 
energy. The entropic term in F is weighted by the temperature, making 
the entropic advantage of vortices dominant at high T. We are led to 
the conclusion that thermodynamics favours the proliferation of vortices 
at high temperature. 

The question is, what happens at low temperatures? This is the ques¬ 
tion that Kosterlitz, Thouless and Berezinskii addressed. Typically of 
condensed matter applications of the renormalization group, we ask what 
happens to the system of vortices as we look at successively longer length 
scales. The renormalization group analysis of the two-dimensional vor¬ 
tex problem therefore involves asking what happens to the important 
coupling constants as we send the length scale of interest to infinity. 


J. Michael Kosterlitz. The transi¬ 
tion was simultaneously discovered by 
Vadim L. Berezinskii in the Soviet 
Union. 


17 

To recap: this is because the spin 
field is swirly at spatial infinity and this 
costs the system an infinite amount of 
energy for a single vortex. 
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Fig. 34.8 A vortex has infinite energy, 
due to the swirliness at spatial infinity. 
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1 See also Chapters 42 and 44 for more 
on the XY model. 

1 The reason for writing the action this 
way is to prevent the integral from di¬ 
verging at the centre of the vortex, 
where the field looks to wind an infi¬ 
nite amount. 


20 See Exercise 34.2. 


21 

More generally J arises from the 
rigidity of the system. Rigidity is a 
general feature of ordered systems, re¬ 
flecting the energy cost associated with 
deforming the order. 


y 

A 



Fig. 34.9 Flow diagram for the 
Kosterlitz-Thouless transition. The 
horizontal axis is K _1 = T/J. 

22 See Altland and Simons, Section 8.6 
for details of the calculation. 


Example 34.3 

We model this system with a complex scalar field given in polar coordinates <f)(x ) = 
e 1 ^*) . At a point in space x we see the field as an arrow of unit length pointing along 
a direction given by an angle 0(x). Such a picture is known as the two-dimensional 
XY model 18 since the arrows are constrained to the two-dimensional plane, where 
their direction may vary continuously. The Euclidean action of a gas of vortices is 
given by 19 

s = S cole (a) + — [ d 2 x\V(t>(x)\ 2 , (34.16) 

2 J a 

where the first term represents the energy of the core region of the vortex, which 
has size a and the second term represents the energy cost of having a spin field 
that varies with position outside the core in a system with sides of length L. The 
constant K = J/T where J is known as the stiffness of the spin system and T is 
the temperature. We may plug in our ansatz for the field at a vortex, given by 
<f>(x) = e 1 ®^), where d(x) + ip = tan - 1 (x 2 /x 1 ) for \x\ —> oo, just as we had in 
Chapter 29. We integrate to find an action 20 

S = S cole (a) + wK ln(L/o). (34.17) 


What are the coupling constants that we change as the system flows to 
larger length scales? 

• The spin stiffness J is one of the coupling constants and tells us 
how much energy cost we pay for having the spins describe some 
non-uniform configuration. This is the parameter that causes a 
single vortex to cost infinite energy. 21 At high temperature, where 
spins are presumably disordered, we do not expect any contribu¬ 
tion of the spin stiffness to the action determining the physics of 
the system over long distances. At low temperature J will be de¬ 
cisive in determining the configuration of spins. We thus need to 
consider the ratio K = J/T which reflects the balance between 
J and T and therefore we use K (actually I\~ l ) as one of our 
variables. 

• The other coupling reflects the energy cost of a vortex core divided 
by the temperature. It is known as the fugacity and is given by y = 
e S core ( a ) ( no te that the Euclidean action is effectively a ratio of 
energy and temperature in statistical mechanics, see Chapter 25). 
The fugacity effectively tells us how the system feels the presence of 
the vortices. A small fugacity tells us that we have a configuration 
with small, energetic and compact vortex cores [with large S' core (a)] 
and that the spins don’t swirl much at infinity; a large fugacity tells 
us that we have less action in the cores and a significant energy 
cost from swirling fields at large distances owing to the existence 
of free vortices. The energy cost here is, as usual for statistical 
mechanics problems, measured with respect to the energy scale 
set by fepT. 

The behaviour of these two couplings predicted by the renormaliza¬ 
tion group 22 is shown in Fig. 34.9. Following the arrows (which show 
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the flow to large length scales) we see that most of the trajectories point 
to K ~ l , y —> oo. This is the expected behaviour in the high-temperature 
phase. The fact that y —> oo tells us that we have free vortices present 
making a non-uniform texture of spins at infinity. In this phase entropy 
wins out and J is ultimately an unimportant consideration in determin¬ 
ing the thermodynamics, and so AT = J/T flows to zero. 

The high-temperature phase, however, is not the only fate of the sys¬ 
tem. We see that in the bottom-left of the plot there are flows that lead 
to fixed points on the A" -1 axis, with y = 0. This is the low-temperature 
phase of the system, where stable configurations of vortices are possible, 
even at T = 0. The fact that y flows to zero means that the vortex con¬ 
figurations must be quite compact, so that the cores look less and less 
significant at larger and larger length scales and that the spin texture 
at infinity must be uniform. We also have that K ^ 0, meaning that 
the system has rigidity. Since rigidity is one of the inevitable side-effects 
of magnetic order, this tells us that the low-temperature phase involves 
some (possibly exotic) type of ordering. 

How can this be? The key to understanding the low-temperature 
phase is the fact that it involves bound states of vortices and antivortices. 
As shown in Fig. 34.10, an antivortex has spins that wind in the opposite 
direction to a vortex. A picture of the low-temperature phase is shown 
in Fig. 34.11 where we see that bound pairs of vortices and antivortices 
are the stable field configurations. The stability of these pairs at low 
temperature is evident if we look at the fields at a large distance from 
a vortex-antivortex pair as shown in Fig. 34.12. The clockwise swirl 
of the vortex at infinity is cancelled by the anticlockwise swirl of the 
antivortex, making y —> 0. This means that in a system with nonzero 
J, the pair costs a finite amount of energy. 

We can even come up with an estimate of the transition temperature 
between the low and high temperature phases. Consider the stability 
of the high-temperature phase, containing a single vortex. The energy 
of the vortex is given (from eqn 34.17) by 7rJln(A/a). The size of the 
entire system is L and the size of a vortex core is a. We may therefore 
fit a vortex into the system in L 2 /a 2 ways, giving an entropy fee In Q = 
/cBln(L/a) 2 . The free energy is then F = ( 7 r J — 2k^T) ln(A/a). The 
only way of preventing this free energy from becoming very large and 
positive is for 2fce T > ttJ. Below T ns 7rJ/2fcB the system is unstable 
to the formation of bound vortex-antivortex pairs. Of course, this is 
identical to the fixed point in the renormalization-group flow diagram 
of Fig. 34.9 at A" -1 = | (apart, of course, from the factor ks that was 
set to one in our hair-shirt quantum field treatment). The transition is 
often also called the vortex unbinding transition, describing the breakup 
of the bound states on warming through the transition region. 
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Fig. 34.10 
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vortex, (b) An an- 


tivortex. 



Fig. 34.11 Schematic of the low- 
temperature phase, viewed at long dis¬ 
tance. The +s represent vortices, the 
—s anti vortices. They form a gas of 
dipole-like pairs. 
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Fig. 34.12 A vortex dipole made from 
a vortex and antivortex in the same re¬ 
gion of space. Notice that the fields 
may now be made uniform at infinity. 
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Chapter summary 

• The renormalization group is a method for studying how coupling 
constants change with rescaling. 

• This method provides insight into many physical phenomena, 
including asymptotic freedom, Anderson localization, and the 
Kosterlitz-Thouless transition. 


Exercises 

(34.1) We will show using dimensional analysis that the 
conductance of a d-dimensional solid varies as 
G(L) oc L (d - 2) . 

(a) For d = 3 consider a cubic sample with sides of 
length L and show G = aL, where a is the conduc¬ 
tivity. 

(b) For d = 2 consider a flat plate of thickness a 
and other sides of length L and show G = aa. 

(c) For d = 1 show G = aa/L. 


(34.2) By considering the ansatz in the text verify that the 
action of a vortex is given by 

S = S cole (a) + -kK In(L/o). (34.18) 

If instead the phase winds n times round a vortex, 
show that 


S = S cole (a) + 7r Kn 2 In (L/a). 


(34.19) 
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In this chapter we are going to study how the coupling constants in a 
Euclidean theory vary with increasing length scale. This will tell us 
about the physics of phase transitions. To make things concrete, we will 
discuss the magnetic transition separating a paramagnetic phase and a 
ferromagnetic phase. 1 Perhaps the most wonderful feature of a phase 
transition is universality. This is the notion that some of the key 
properties of a phase transition depend only on the dimensionality of a 
system and dimensionality of its order parameter field. The details of 
the microscopic interactions between the constituent parts, whether the 
transition is ferromagnetic or superconducting, or whether it involves 
polymers or the early Universe, are all irrelevant. Thus focussing on the 
ferromagnetic transition will nevertheless give results that are applicable 
to any system described by the same Lagrangian. 


35.1 Background: critical phe¬ 

nomena and scaling 313 

35.2 The ferromagnetic transition 
and critical phenomena 315 

Chapter summary 320 

Exercises 320 

^We have already discussed the mean- 
field treatment of this in Chapter 26. 
In this chapter we will include the pos¬ 
sibility of fluctuations in the fields. 


35.1 Background: critical phenomena and 
scaling 


Magnetic phase transitions may be observed with several experimen¬ 
tal techniques. Some thermodynamic properties of a magnet in which 
we might be interested include the magnetization M (which is the or¬ 
der parameter of the ferromagnet system); the magnetic susceptibility 
X = lima^o , which tells us how much magnetization we can achieve 
through applying a field B\ and the heat capacity C, which tells us how 
the energy of the system varies with temperature. To these thermody¬ 
namic properties we can add the correlation length £, which tells us 
how well correlated the fluctuations are in space (Fig. 35.1). 

The correlation length is related to the correlation functions, which 
are the Green’s functions of Euclidean space. As described in Chap¬ 
ter 25, a slight modification of the usual correlation function G(x, y) = 
{4>{x)(j)(y))t is useful when studying phase transitions. This is the con¬ 
nected correlation function Gdx^y), given by 


(35.1) 
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Fig. 35.1 The correlation length in 
a paramagnet showing a ferromagnetic 
fluctuation of size £. 


G c {x,y) = {Hx)Uy))t - (Ux))t(<i>(y))t, 
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2 Closer to the disturbance the correla¬ 
tions are found to decay more slowly, as 
described below. 


3 

Unlike, for example, exponential func¬ 
tions such as e — ®/£, power laws are free 
from any dependence on a length scale. 
This scale-free property is a hallmark of 
critical behaviour, where the character¬ 
istic fluctuations occur over all length 
scales. 


4 In these calculations, one studies the 
reduced free energy f (dividing F 
by both volume V = L d and temper¬ 
ature T) so that / = F/(L d T). We 
also define reduced temperature t = 
\T — T c \/T c and reduced field h = B/T. 


and differs from G(x , y ) by the subtraction of a product of thermally 
averaged fields {<j>(x))t{<j>(y)}t- At temperatures above the transition 
temperature T c we have {(/>(x ))t = 0 and the definition is identical to 
the usual form of the correlation function. At temperatures below T c , 
subtracting off the mean-fields allows us to just see the fluctuations away 
from average behaviour. For example, in a magnet above T c , despite 
the spins being disordered they may save energy by pointing in the 
same direction, and will tend to do so within the correlation length £. 
The correlation between spins at position x and position y decreases 
with increasing \x — y\ because thermal fluctuations have a tendency to 
randomize the spins. Thus we find that 2 

G c (x,y)~e^~y '/«, (35.2) 

for \x — y | £. The subtraction of the average fields in our definition 

of G c means that this is also the behaviour below T c and represents the 
fluctuations away from perfect order. 

At temperatures close to the ordering temperature T c it is found exper¬ 
imentally that the physical properties of a magnet described above fol¬ 
low power law behaviour 3 as a function of temperature, applied magnetic 
Held or distance. This motivates the definition of critical exponents 
to describe the phase transition. Defining the reduced temperature as 
t = (T — T c )/T c , the critical exponents a, /3, 7 , <5, u, £ and rj are related 
to physical properties as follows: 

• Heat capacity: C ~ 

• Magnetization: M ~ for B —> 0, T < T c , 

• Magnetic susceptibility: \ ~ |f| -7 , 

• Field dependence of x at T = T c : \ ~ \B\ 1 ^, 

• Correlation length: £ ~ \t \~ v , 

• The correlation function G(r) behaves like 

CM ~ | r | <<: ( (35.3) 

[e « M>£, 

where r is distance and d is the dimensionality of the system. 

A renormalization group analysis of the ferromagnet relies on varying 
the length scale at which we examine the physics. Well before the as¬ 
cent of the renormalization group, Benjamin Widorn suggested that the 
critical behaviour that is observed experimentally may be explained if 
the free energy 4 of the system has a particular form under such a scaling 
of the length scale. The idea is that when rescaling lengths by a factor 
b (so that a length L —> bL ), we should expect 

f(t,h)=b~ d f(t',ti), (35.4) 

where the factor b~ d is because of the L~ d factor in /. Widom assumed 
that t' = b Vt t and b! = b Vh h , where y/i and y t are exponents yet to be 
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determined. Thus the Widom hypothesis is 


/(t, h) = b~ d f(b Vt t, b Vh h). (35.5) 

We may use the fact that b is an arbitrary parameter and choose a value 
of b such that b Vt t = 1. This reduces the number of free parameters by 
one, and we obtain 

f(t,h) = t d ^f(l,h/ty^), (35.6) 


which is Widom’s hypothesis, written in reduced variables. Using this 
result, and also expressions relating the free energy to physical properties 
such as magnetization, heat capacity, magnetic susceptibility, etc., one 
can show that the critical exponents can be expressed in terms of the 
y 's as follows: 5 


a = 2 — —, ( 3 =^ 

Vt ’ r Vt 

Vh 


6 = 


~ — 2 Vh~d 
' Vt ’ 


d-Vh ’ 


v =h> ii = 2 + d-2y h . 


Our task is now to use the renormalization group to find y t and yh- 
We find these quantities by noting how the quantities t and h flow on 
renormalization. Since we have the predictions t —> b Vt t and h —» b Vh h , 
we should be able to simply read off the values of yt and yh . If we can 
do this then we have access to all of the critical exponents, which we 
can compare with experiment. 


35.2 The ferromagnetic transition and 
critical phenomena 


Let’s feed a theory of the ferromagnet through the renormalization group 
machine and extract some physics. A continuum field theory of the 
ferromagnet is given by the Landau—Ginzburg model. This model 
is equivalent to (time-independent) d-dimensional Euclidean </> 4 theory, 
which has an action 6 


S'e = 






(35.7) 


Example 35.1 

We may see what the Landau-Ginzburg model predicts in mean-field theory where 
we take 0 to be uniform across the whole system . 7 We then have a free energy 

F =—$ 2 + -< t > 4 , ( 35 . 8 ) 

2 4! ' 

where <E> = f d d x If A = 0 then the minimum free energy occurs at M = 0 and 
we have a paramagnet. If A > 0 and m 2 = a(T — T c ) < 0 then we have minima 
at M = [6a(T c — T)/A] 2 ^ 0, giving a ferromagnet. However, mean-field theory 
is blind to fluctuations. We will show with the more sophisticated approach of the 
renormalization group that even if m 2 < 0 and A > 0 a phase transition is not 
guaranteed due to the influence of these wobbles in the field. 


^See Exercise 35.1. 


Vitaly Ginzburg (1916-2009) 

^ We can choose d to suit whatever di¬ 
mensionality of system we need. Or¬ 
dinarily we imagine materials having 
d = 3, but it is possible in condensed 
matter physics to find materials which 
effectively have d = 2 and d = 1. 


j 

Note that this approach recreates the 
Landau theory of phase transitions 
used in Chapter 26. 
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8 In this context, the A = 0 free theory As a warm-up exercise we’ll set the coupling A = 0 and carry out the 
is called the Gaussian model. renormalization group procedure for the free theory. 8 


Example 35.2 

We choose to work in momentum space, so we write the action in terms of fields in 
momentum space in the usual manner: 

S E = ^ J+ m 2 )4>(p). (35.9) 

To set the scene, we split the fields up into slow and fast via (ft = (ft f + where 


4>(p) = (p) for A/6 < p < A, 
4>(p) = 4> s (p) for 0 < p < A/6. 


(35.10) 


The action in eqn 35.9 is diagonal in p, which means that slow and fast fields will 
not be mixed up in the calculation of S. Let’s go through the steps with this free 
theory. 

Step I involves integrating over fast momenta. This just results in a constant con¬ 
tribution to the action, which we are free to ignore. We therefore have the integral 


Z=[ D4, s e- S ^=[ V4, se - hS ^ Up){p2+m ^ {p) 

JA/b J A/ 


/ A/b JA/b 

Step II: We scale the momenta with p'/6 = p to obtain an action 
6 


Se = 


k —d /*A ^dp/ 


[ FTu M-J/b) ( b~ 2 P ,2 + m 2 ) Mp'/b). 

Jo (2n) a 


Step III: We scale the fields via 4> s (p' /6) = b d d<t> <ft'(p') to obtain 

Se = 


b 2 (d - d , )b - d ,A d y , 2 , , 


[ /^4>’(- P ')(b- 2 p' 2 +m 2 )4>'(p'). 

Jo {Z7T) a 


(35.11) 


(35.12) 


(35.13) 


Fixing the constant d^ so that the gradient term (f>(—p)p 2 4>(p) is invariant, we have 
d — 2d ( j ) — 2 = 0 or d^ = (d — 2)/2 and 



d d p' 
(2n) d 


(A 2 


m 2 6 2 ) 4>'(p')- 


(35.14) 


The result of our renormalization procedure is that after scaling, we find that we 
have a mass term m' 2 = m 2 b 2 . That is, since 6 > 1, the coupling constant m 2 gets 
larger with each rescaling. 

Since 6 > 1, the fact that m' 2 = m 2 b 2 means that m 2 is a relevant variable 
and (ft 2 is known as a relevant operator. This means that it gets larger with each 
renormalization. (In contrast a variable that becomes smaller at each rescaling is 
known as an irrelevant variable.) Identifying m 2 from our action with the reduced 
temperature t, we expect that each rescaling changes m 2 by a factor b yt . From this 
analysis we can conclude that yt = 2 when A = 0. 

In order to find y ^ we need to consider the application of an external magnetic 
field. Still holding A = 0 we write an action that includes the influence of the field as 


Se = 





(35.15) 


On rescaling the field-dependent term will become —hb d d< t>(ft , (p'). Substituting for 

d +2 

d,p as before we have h = hb 2 5 and so yh = (d + 2)/2. This allows us to write the 
exponents predicted from this A = 0 theory. 



35.2 The ferromagnetic transition and critical phenomena 317 


Now let’s turn on the interaction. Since we don’t know how to integrate 
the functional integral for the <j > 4 theory it should come as little surprise 
that neither can we do the integral over the fast modes for </> 4 theory. We 
therefore have to treat £i = as a perturbation and do the integral 
approximately. Our previous method for working out approximations to 
the generating functional Z[J] using Feynman diagrams may be adapted 
to get an approximate answer for the result of removing the fast modes. 
The only difference in calculating the Feynman diagrams for the renor¬ 
malization group analysis is that we integrate over internal momenta 
only over the fast modes, up to the cut-off. 

In order to work out the lowest order corrections to the couplings we 
consider diagrammatic corrections to the theory in order of how many 
loops they contain. The lowest order diagrammatic corrections to the 
theory therefore involve diagrams with a single loop that correct the 
self-energy (which determines m 2 ) and the phion- -phion vertex function 
(which determines A). These one-loop diagrams are shown in Fig. 35.2. 


Example 35.3 

We give the results of the important computations here. 

The first diagram in Fig. 35.2 corresponds to an amplitude — ^f p 2+ m 2 » 
which yields a contribution SS^ to the quadratic part of the action given 9 by 


ss^ = 


L 


A jdp 
A/6 (27r) d p 2 - 


/< 


(35.16) 


Doing I: the integral over fast modes, II: rescaling the momentum and III: rescaling 
the fields, we obtain a contribution to the quadratic term 


SS (2) = b 2 


xn d , _ ,2 -d) _ m 2 xn d _ 
2(d — 2)' ; 2(d-4) 1 


/ 


d d x^, 

2 


(35.17) 


which we give in terms of which is the volume of a unit sphere in d dimensions. 
Measured in units of 2n this volume is given by ^r(d/ 2 ) ’ w ^ ere Ffz) = 

/ 0 °° d t e~ t t z ~ 1 is the Gamma function. The usual procedure in these calculations is to 
work in four-dimensional spacetime where the integrals converge and fid —4 = 1 /Sn. 
To see the effects in other dimensions we set d = 4 — e, and expand in the small 
parameter e. To examine the real-life case of three dimensions we then set e = 1, 
well outside the presumed range of applicability of this approach. Amazingly, this 
procedure describes reality very well! 

Turning to the second diagram in Fig. 35.2, it turns out that we only need to 
evaluate the amplitude for zero external momentum , 10 which means both internal 
propagators carry the same momentum. We then obtain a contribution SS^ to the 
quartic term in the action 


ss w = — 
2 


L 


d d q 


1 


= V 


A/6 (27r) d (q 2 + m 2 ) 2 

(1 - fe 4 - d ) J , 


/ 


4 —d 3\~£i d 
2 (d - 4)' 


d d x — 
4! 


4! 


(35.18) 


The result of this expansion is that we must shift the variables according to 


= b 2 


\' = b 4 


A - 


Xfld 
2 (d - 2) 
3X 2 Q d 
2(d — 4) 


(1 _ b 2—d) _ 


(1 - b 4 ~ d ) 


m 2 Xfl d 
2(d — 4) 


(1 - b 4 ~ d ) 


(35.19) 




Fig. 35.2 One-loop Feynman dia¬ 
grams in 0 4 theory. 

^ Although both of our diagrams have 
symmetry factor 2, there is a subtlety 
here in working out the prefactors re¬ 
flecting the fact that there is more than 
one equivalent diagram for each loop. 
As discussed in Binney, Dowrick, Fisher 
and Newman, the correct prefactor for 
a diagram with p loops is • 


1( ^See Altland and Simons, Section 8.4 
for a discussion. 
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P 

/b 



Fig. 35.3 The (3 function for the be¬ 
haviour of the coupling constant A. 


A 



Fig. 35.4 Renormalization group flow 
for a ferromagnet. FM labels ‘ferro- 
magnet’, PM labels ‘paramagnet’. 

11 The predictions of the previous ex¬ 
ample are seen in the flows close to W 2 , 
where we see A decrease or increase de¬ 
pending, respectively, on whether we 
start above (flow B) or below (flow C) 
the value A* = 167T6/3. Flows A and 
D are clearly more complex and rely on 
the magnitude and behaviour of m 2 . 


Now to extract the physics. We’re looking at a system of condensed mat¬ 
ter, so we’re interested in a scaling that makes the length scale longer. 
We therefore set b = e l . Since, as stated in the example, the inte¬ 
grals only converge for d = 4, we set d = 4 — e and expand in the 
parameter e. Evaluating the expressions to leading order in e, using 
^ 4 _ e « ^4 = 1 / 87 T 2 , we obtain the Gell-Mann-Low equations 


dm ' 2 

~dT 

dA' 

~dT 


2 TO 2 


eA — 


A 

16tt 2 

3A 2 


167T 2 


(1 - m 2 ), 


(35.20) 


As shown in Fig. 35.3, these equations have fixed points at (m 2 , A) = 
( 0 , 0 ) and (—|, ^T e )- By expanding the flow around the fixed points 
we can use this information to predict the behaviour of the ferromagnet. 


Example 35.4 

To see the behaviour near the fixed point we employ a first-order expansion to come 
up with matrix equations predicting the flow. A Taylor expansion of f(x) about a 
fixed point x* yields f(x) = f(x*)-\~f'(x*)(x — x*), but since f(x*) = 0, by definition, 
we simply need to evaluate the derivatives at the fixed points. Our linearized flow 
equations take the form 



with the elements evaluated at each of the fixed points. 

For our ferromagnet, the matrices for the two sets of fixed points are 

1 i \ 

3 167T 2 j 



(35.21) 


(35.22) 


Let’s examine how these equations predict the behaviour of A. For the fixed point 
at (0, 0) (corresponding to W\) we have X' = eXl. If e > 0 (that is, we are examining 
the physics in less than four dimensions since d = 4 — e) then A increases as we look 
at larger scales. In that case A is a relevant variable and will be important to the 
physics. Moving on to the other fixed point (—e/6,167r 2 e/3), we see that close to 
A* = 167re/3 we have X' = —e(A — A*)/. In this case A is relevant (i.e. gets larger on 
rescaling) for A < A* but irrelevant (i.e. gets smaller on rescaling) for A > A*. We 
will see the influence of this on the flow of the system next. 


This information, along with an analogous analysis of how m 2 behaves, 
allows us to draw the phase diagram of the ferromagnet, as shown 11 
in Fig. 35.4 for e > 0. The fixed point at W\ is repulsive and that 
at W 2 is mixed: some flows are directed towards it, some away from 
it. Let’s return to the physics of the system: at high temperatures we 
have a disordered, paramagnetic phase; at low temperatures, we suspect 
from our mean-field analysis that there exists an ordered, ferromagnetic 
phase. Relating this to our model, we write m 2 = a(T—T c ), so flows that 
send to 2 to large, positive values predict that the system will end in the 
paramagnetic phase. Conversely, flows that send in 2 to negative values 
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predict that the system becomes ferromagnetic. By examining the flows 
we ask what influence the fluctuations have on the phase transition. 
Recall that our mean-field analysis makes a prediction that any system 
that starts with a negative m 2 will be a ferromagnet and any system 
that starts with a positive m 2 will be paramagnetic. 

The result of our renormalization group analysis is that flows starting 
with positive values of in 2 are always driven towards the paramagnetic 
phase, whatever the value of A, in agreement with the mean-field treat¬ 
ment. However, starting with a negative value of m 2 doesn’t guarantee 
a ferromagnetic ground state! The fact that we have fluctuations, whose 
strength is given by A, may force the system to be a paramagnet. This 
is the case for the flow B in Fig. 35.4, where starting with a significant 
enough A is enough to drag the system towards paramagnetism. On the 
other hand, flows starting with sufficiently small A and sufficiently neg¬ 
ative m 2 (such as paths C and D) eventually flow off to the left, pushing 
the system towards ferromagnetism. 


Example 35.5 

So much for our qualitative analysis of the phase diagram. Let’s extract yt and 
yh and make some quantitative predictions regarding the thermodynamics of the 
ferromagnet. One of the uses of the linearized flow matrices is that they may be 
used to read off the scaling constant yt. Since we have set b = e l , we have that 
each rescaling goes as m' 2 = e lyt m 2 . Close to the fixed point, where our linearized 
equations are valid this implies that Am' 2 = yt(tTi 2 — m* 2 )Al. We can therefore 
read off from the matrix describing the flow around our nonzero fixed point that 
yt = 2 — The one-loop correction doesn’t change y which continues to be given 
by yh = (d + 2)/2 = (6 — e)/2 just as it was from the analysis of the A = 0 case. 
With these two pieces of information, we obtain all six of the critical parameters: 

a=%, 7 = 1 + f i 5 = 3 +e, y=| + ^, 77 = 0 . (35.23) 

These results can be used to make an estimate of the critical exponents of the three- 
dimensional Ising model, as shown in the following table. 12 



a 

P 

7 

<5 

V 

V 

e = 0 (d = 4, mean field) 

0 

1 

2 

1 

3 

1 

2 

0 

6 = 1 0(6) 

0.167 

0.333 

1.167 

4 

0.583 

0 

O 

ii 

0.077 

0.340 

1.244 

4.462 

0.626 

0.019 

3D Ising 

0.110 

0.327 

1.237 

4.789 

0.630 

0.036 

2D Ising (exact) 

0 

1 

8 

7 

4 

15 

1 

1 

4 


The first line of this table gives the mean-field results, exact for d = 4 and so can 
be obtained from eqn 35.23 by setting e = 0. The second line gives a crude, but 
surprisingly accurate, estimate for d = 3 by setting e = 1. One can derive expressions 
for the critical exponents to second order in e [e.g. (3 = § — f + + C(e 3 )] an d 

these yield the estimates in the third line of the table. These approach the currently 
accepted values 13 (fourth line) which agree well with experimental values. In the last 
line we list the exact results for the two-dimensional Ising model for comparison. 


12 

For a discussion of the comparison 
see the books by Kaku, Zinn-Justin 
and Binney et al. See also Peskin and 
Schroeder, Chapter 13. 


13 Taken from A. Pelissetto and E. Vi- 
cari, Phys. Rep. 368, 549 (2002), 
which reviews a large number of theo¬ 
retical studies using various techniques 
and also summarises experimental re¬ 
sults. 
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Chapter summary 


• This chapter has provided a renormalization group analysis of the 
ferromagnetic transition following Widom’s hypothesis for the scal¬ 
ing of the free energy. 


Exercises 


(35.1) (a) Using Widom’s hypothesis in the form f(t, h ) = 

a ~ Vh 

t y tf(h/t y t), show that critical parameters are 
given in terms of the scaling parameters as follows 

« = 2-£, P=^, 7=^- (35.24) (35.2) 

(b) We now redefine the scaling function slightly so 

d Vh 

that it reads f(t,h) = h y s. g(h/t y t ) where g(z) = 
z Vh f( z )■ Use this form of the function to show 
that 5 = , Vh . 

d ~ y h 

(c) From these results prove the following relations: 


a + 2/3 + 7 = 2 Rushbrooke’s law, 
a + p(5 + 1) = 2 Griffith’s law. 


(35.25) 


quire vd, = 2 — a (known as Josephson’s law) and 

use this to show v = —. 

yt 

(e) Finally use Fisher’s law (2 — r/)ty = 7 to find 77 . 

(a) Starting from eqns 35.19, verify the Gell-Mann- 
c.Low equations in eqn 35.20. 

Hint: let b = 1 + p with p small. Then In b ~ p 
and one may expand to order p to capture the flow 
equations. 

(b) Verify the positions of the fixed points. 

(c) Verify the matrices in 35.22 and the behaviour 
of the flow near the fixed points. 


(d) Assuming the correlation function behaves as 
G c (x,t) = f ^ /x d ~ 2+v , argue that we re- 


(35.3) 


What is the behaviour of A for the case of dimen¬ 
sions d > 4? 




Part IX 


Putting a spin on QFT 

This part extends our treatment of quantum field theory to spinors and 
allows us to treat the electron spin. These ideas set us on a path that 
inevitably arrives at quantum electrodynamics. 

• The Dirac equation for an electron is introduced in Chapter 36. 
We describe the 7 -matrices necessary for formulating this and also 
the concepts of chirality and helicity. The basis states are called 
spinors and we describe two types: Dirac spinors and Weyl spinors. 

• We show how spinors transform under rotations and boosts in 
Chapter 37. We also demonstrate that the parity operator turns a 
left-handed spinor into a right-handed spinor and vice versa. 

• In Chapter 38 we apply the machinery of quantization to the Dirac 
equation and derive the Noether current. This allows us to write 
down the fermion propagator. 

• Arguably the most important achievement of quantum field theory 
is quantum electrodynamics (QED), which describes the interac¬ 
tion between electrons and photons, and this theory is presented 
in outline in Chapter 39. 

• Three examples of ideas presented in the previous chapter are 
fleshed out in Chapter 40 via three examples. These are Ruther¬ 
ford scattering, the Mott formula and Compton scattering. We 
briefly discuss the useful feature of crossing symmetry. 

• Some important consequences of QED arise when you consider pro¬ 
cesses involving electron-positron loops. These include the renor¬ 
malization of the electron charge due to vacuum polarization and 
the deviation of the ^-factor from g = 2. These ideas are explored 
in Chapter 41. 
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After seeing 2001: A Space Odyssey , 


Paul Dirac was so impressed that, two 
days later, he attended three consecu¬ 
tive screenings of the film in the cinema. 
His favourite television programme was 
the Cher show. 


^This is the subject examined in the 
two chapters following this one 


The Dirac equation 


I have an equation, do you have one too? 

Paul Dirac (1902-1984), on meeting Richard Feynman 

Our story so far: as a single-particle equation of motion, the Klein 
Gordon equation has some serious problems. The first is negative energy 
states and the second is negative probabilities. Paul Dirac sees that the 
root of these problems lies in the fact that the Klein-Gordon equation 
contains a second-order time derivative as part of d 2 . While staring into 
the fire in the senior common room of St John’s College, Cambridge, 
Dirac realized that there’s a way to construct an equation of motion 
that’s first order in the time derivative. 

In this chapter we examine Dirac’s equation. We will treat it as an 
equation of motion for single particles and examine the form of the 
solutions and how features of these solutions describe the physical prop¬ 
erties of real particles. Although our experience with the Klein-Gordon 
equation has taught us that, ultimately, a single-particle treatment of 
relativistic quantum mechanics will not provide an adequate description 
of Nature, the experience gained with the solutions of Dirac’s equation 
will carry over to the world of Dirac fields 1 whose excitations are the 
fermions we observe in our Universe. 


36.1 The Dirac equation 

In order to avoid all of the problems with the Klein-Gordon equation, 
you might wonder whether we could take the square root of the guts 
of that equation ( d 2 + m 2 ) and then apply it to a wave function. A 
naive guess might be that if we could treat things like d 2 like ordinary 
algebraic symbols then we could write 

(d 2 + m 2 ) = (Vd 2 + im)(Vd 2 — \m). (36.1) 

Unfortunately, Vd 2 isn’t defined. How do we take the square root of the 
d 2 operator? Dirac found the answer was to define a new four-vector 
7 m , whose components have the properties 

( 7 °) 2 = 1, (7 1 ) 2 = -I (7 2 ) 2 = -I (7 3 ) 2 = -1- (36.2) 

Furthermore, these components aren’t ordinary numbers; they anticom¬ 
mute. That is to say that for /i ^ v they have the intriguing property 
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y^y" + 7 ^ 7 ^ = { 7 ^, 7 "} = 0. Taken with eqn 36.2, the properties of 
these 7 ’s may be summarized in the compact form 2 

{ 7 ^, 7 "} = 2 <T. (36.3) 

We use the gammas to cope with the square root in V& 2 , via a strange 
looking new symbol jf = 7 which is pronounced ‘a-slash’. We can 
then say that 

^ 7 ^=( 7 oA +7 >A + 7a | + 7 3jt) 2 . (JM) 

Expanding this bracket and using the all-important anticommutation 
relations we find that 0 2 = d 2 . Substituting 0 2 for d 2 in the Klein 
Gordon equation does indeed allow us to factorize 

(i d 2 + to 2 ) = ( 0 2 + to 2 ) = (0 — i m){0 + i?n). (36.5) 

Now let’s just take the plus-sign bracket and treat it as an operator 
acting on the wave function ip{ x )> 

{0 + i m)ip(x) = 0 , (36.6) 

and this may be tidied up to give us the famous Dirac equation: 

(iy^d^ — m)ip(x) = 0. (36.7) 

We note first that the Dirac equation may also be written as — = 

0 , where f = y^ic^) and we recognize the energy-momentum operator 3 
Pp 

We’ll soon see that the dispersion relation of the particles described 
by the Dirac equation of motion continues to be E 2 = (p 2 +m 2 ), which 
still admits negative energy states . 4 

To get to Dirac’s equation, we have had to introduce the mysterious, 
anticommuting 7 ^ objects. These will have a profound influence on the 
physics described by the Dirac equation because the simplest way to 
satisfy the commutation relations is to represent the y’s as four-by-four 
matrices. This means that the wave function ip( x ) described by the 
Dirac equation is a special four-component wave function. Why four 
components? What could they represent? To make progress, we’ll use 
an explicit form for the matrices and examine some illuminating special 
cases. 

36.2 Massless particles: left- and 
right-handed wave functions 

There is no unique way of writing the y M matrices and so we have a 
choice about how to represent them. One set of matrices that satisfies 


2 r> 

Equation 36.3 defines what is known 
as a Clifford algebra, named after 
William Kingdon Clifford (1845-1879). 
See Penrose, Chapter 11 for more de¬ 
tails. 


^Recall that p^ = \d^ = i(J^, V). 


^It does successfully eliminate the neg¬ 
ative probability problem. Of course, 
we’ve already found a way to deal with 
the negative energy states of the Klein- 
Gordon equation: Feynman’s prescrip¬ 
tion of interpreting them as positive en¬ 
ergy antiparticles, so the negative en¬ 
ergy states are not a deal-breaker. In 
fact, the Dirac equation will turn out 
to provide the equation of motion for 
a type of field whose excitations are 
fermions such as the electron. As such, 
the results from this temporary foray 
into wave mechanics should be treated 
as provisional. 
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This is sometimes called the Weyl 
representation. 


^If, at any point in the future, any 
of this notation seems overly compact, 
then it may sometimes be helpful to 
write things out in full. 


7 

The meaning of these subscripts will 
be explained shortly. 


the commutation relations is 


/ 

0 

0 

1 

0 ^ 


/ 

0 

0 

0 

1 \ 


0 

0 

0 

1 

1 


0 

0 

1 

0 


1 

0 

0 

0 

, 7 = 


0 

-1 

0 

0 

V 

0 

1 

0 

0 ) 


V 

-1 

0 

0 

0 / 


(36.8) 
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0 

0 
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/ 

0 

0 

1 

0 \ 

0 

0 

i 

0 



0 

0 

0 

-1 

0 

i 

0 

0 

J 7 3 = 


-1 

0 

0 

0 

V -i 

0 

0 

0 ) 


l 
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0 

0 / 


Writing the 7 ’s in this form is known as the chiral representation , 5 
for reasons that will become apparent. Writing out lots of four-by- 
four matrices is a tedious business, so to save on writing it’s useful to 
write these four-by-four matrices out as two-by-two matrices, where each 
element is itself a two-by-two matrix. 


Example 36.1 

There are several ways to simplify the notation. Defining I = 
that we can express eqn 36.8 in a 2 x 2 form: 

0 ’ 71 = ( -A 
?)■ 


7° = 


7 2 = 


0 

I 

0 

O' y 


O'x 

0 


0 


we notice 


(36.9) 


where the Pauli matrices have their usual definitions. 

It turns out that y 7 ' transforms as a four-vector. Motivated by this, we write 
7 m = (7°i 7 1 ) 7 2 j 7 3 ) = (7°) 7) and then 


7° = 


( 0 

I \ 


f 0 cr \ 


0 j 

.7=1 

[ -cr 0 ) 


where cr = (a 1 , cr 2 , <r 3 ). Finally, in a desire to write as little as possible, 
it 71 = (I, cr) and ct 7 * = (I, — cr ) to obtain 6 


0 


0 


(36.10) 
we define 

(36.11) 


Substituting these 7 ’s into the Dirac equation gives 

(loP° - 7 • P - = 0, (36.12) 


where p° = i<9o and p = —iV. We’ll make use of a handy two-by-two 
matrix form for the equations, where the four-component wave function 

^ L ^, where V’l and i/jr are, themselves, two-component 


tp is written 


•0R 


column matrices.' In this form the Dirac equation reads 


Y 0 

p° ^ 

\ ( 

' 0 

a • p 

A p° 

0 y 

1 l 

—cr • p 

0 


m 0 
0 m 



(36.13) 
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which can be rewritten as 

(po-cr-p)V>R = mip L , 

(po + cr • p)ipL = mipB.. (36.14) 

Now let’s further simplify things by considering the special case of a 
massless particle (which obviously has m = 0). We then obtain 

(p°-<r- p)ip r = 0, 

{p° + (r-p)ip L = 0, (36.15) 

which implies that a four-component eigenstate ip = ( ) splits into 

\ VR J 

two two-component pieces, i/’l an d ipn, that don’t get mixed up by the 
equations of motion. It’s as if the Dirac equation is telling us that 
we have two sorts of massless Dirac particles in Nature: left-handed 
particles described by wave functions ipL and right-handed particles 
described by wave functions t/’r. In the chiral representation, left-handed 
functions live in the upper two slots of the Dirac wave function and right- 
handed functions live in the lower slots. 


Example 36.2 

We need to be slightly more precise with our definition of left- and right-handed wave 
functions. To do this we define the chirality operator as 


7 5 = i7°7 1 7 2 7 3 


-I 

0 


(36.16) 


where the final equality holds in the chiral representation only. A wave function 


•0L 

0 

function 


is an eigenstate of 7 5 with eigenvalue —1. An entirely right-handed wave 
0 


is then an eigenstate with eigenvalue +1. The Dirac equation 

therefore predicts the existence of two types of Dirac wave function in the Universe: 
left-handed functions with chirality —1 and right-handed functions with chirality +1. 

If we want to extract the left- and right-handed parts of a wave function we may 
define projection operators via 

7 = 7^, 7 = ^, (36-17) 

which will project out the desired part. 


We conclude that since ipL and ipB are eigenstates of the massless Dirac 
equation, then a free and massless left-handed particle may never change 
into a right-handed particle. Moreover, using the fact that, for massless 
particles, the eigenvalue of p° is E p = \p\, our eqns 36.15 give us 

= iPr, (36.18) 

\P\ 

~yy^ L = ipi-i- 

\p\ 


(36.19) 
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8 Of course we know that this is a bad 
way of thinking about them since phys¬ 
ical antiparticles have positive energy. 
We apologize for the brief lapse into 
bad habits. 


This means that the massless states and ^l, defined as being chirality 
eigenstates, are also eigenstates of the helicity operator 

h=^T, (36.20) 

\P\ 

with eigenvalues +1 and —1 respectively. 

From this discussion it looks rather like chirality and helicity, while 
being represented by different operators, tell us the same thing. How¬ 
ever, one needs to be careful here: helicity and chirality are identical 
for massless particles, but are, in general, not the same. Clearly helic¬ 
ity tells us whether the particle’s spin and momentum are parallel or 
antiparallel. Helicity therefore depends on the frame in which it is mea¬ 
sured. In the case of massive particles it is possible to describe a particle 
with positive helicity and then to boost to a frame where the particle’s 
momentum is reversed but its spin is unchanged, thereby reversing its 
helicity. 

If we now consider massive particles again then the equations of mo¬ 
tion revert back to eqn 36.14. This shows us that massive Dirac particles 
involve both left- and right-handed wave functions coupled by the par¬ 
ticle’s mass. We can think of massive Dirac particles oscillating back 
and forth in time between left- and right-handed at a rate determined 
by their mass. This is most easily seen by looking at massive Dirac 
particles at rest, where we have idoipn = Tntfh and iSoV’L = mip r. 


Example 36.3 

We’ll find the dispersion relation for massive Dirac particles. Assuming that an d 
t/iR, are eigenstates of momentum and energy, we may eliminate from eqns. 36.14, 
to give us 

(p° + cr ■ p)(p° - er • p)ipR = m 2 i/> r. (36.21) 

Multiplying this out gives 

[(P°) 2 - P 2 } = m 2 , (36.22) 

which demonstrates that E p = =t(p 2 +m 2 ) 2 and we still have negative energy states 
as claimed above. Another way of saying this is that the eigenvalue of p° could be 
taken to be positive or negative and we still get the same dispersion. 


Finally we turn to antiparticle solutions of the Dirac equation. These are 
the solutions which appear to have negative values of energy. 8 Writing 
energies as E = — |po|> we find, for massless antiparticles, that 


Hp°| - v-PWr = 0 
Hp° 


fj. — 0 J an tip ai 'ticles, 


which implies 


= —i/’r 1 

^ „ > antiparticles. 

= V’L J 


(36.23) 


(36.24) 


This is to say that right-handed antiparticles have negative helicity and 
left-handed antiparticles have positive helicity. Note that this is the 
other way round compared to the particle case. 



36.3 


Dirac and Weyl spinors 327 


To summarize our progress so far. We’ve discovered that the four- 
component solutions to the Dirac wave function have left-handed parts 
in the upper two slots and right-handed parts in the lower two slots and 
can be written 

=( tU ) ■ <36 - 25) 

The four-component wave functions are known as Dirac spinors and 
are quite different to other objects that we have so far encountered 
such as scalars and four-vectors. The two-component objects iI>l(x) and 
iPr(x) are themselves known as Weyl spinors. Spinors are best defined 
in terms of their transformation properties, which we turn to in the next 
chapter. In the meantime we will examine them further as solutions to 
Dirac’s equation. 


36.3 Dirac and Weyl spinors 


We now investigate what’s hidden away in ipi,(x) and i/jr(x). Let’s work 
on the positive and negative energy solutions separately. Since physical 
particles and antiparticles really have positive energies, we’ll conform to 
convention and call these the positive and negative frequency solutions. 
The positive frequency solutions will describe Fermi particles, while the 
negative frequency solutions will describe Fermi antiparticles. It’s easy 
to show that a set of solutions to the Dirac equation may be written 


Particles: u(p)e lpx = ^ 

Xh(p) A i p. x 

vr Ip) ) 


Antiparticles: v(p)e lp ' x = 



(36.26) 


Note that the negative frequency solutions are defined such that p° = 
E > 0 so that antiparticles will have positive energies. The four- 
component objects 9 u(p) and v(p) are momentum space Dirac spinors 
and themselves satisfy the momentum space Dirac equation: 


(— m)u(p) = 0, 

{-p-m)v(p)= 0. (36.27) 

The left-handed [ufrp) and i'lCp)] and right-handed [ur(p) and z>r(p)] 
parts are momentum space Weyl spinors. 

We start by considering the particle solution at rest where p p = ( m , 0). 
In this case the Dirac eqn 36.27 becomes 

(T -” ){uX°\ )=°- < 36 - 28) 

with solutions 

“0“> s (£$) ) = ^ { ) • < 36 - 29 > 


Hermann Weyl (1885-1955) 


^ Just as a polarization vector e M (p) car¬ 
ries the spin information for the pho¬ 
ton, the spinors u(p) and v(p) carry the 
spin information of the fermion. 
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where £ = 


is a two-component column vector that we will choose 


u The factor y/rn is included in 
eqn 36.29 for later convenience. It re¬ 
sults in uu = 2m where u = is 

defined later (eqn 36.41). For mass¬ 
less particles one includes a factor yjE p 
instead. See Peskin and Schroeder, 
Chapter 3, for more details. 


to normalize 10 according to = 1. Note that £ is also often called a 
spinor. The argument may be repeated for antiparticle solutions with 
the result 


vr (p°) 


leading to 


v(p°) = 


(36.30) 


(36.31) 


where we write p = ^ P1 ^ for the antiparticle spinor. Notice that the 

solutions to the Dirac equation have two degrees of freedom. These are 
the two components of £ (for the positive frequency solutions) or p (for 
the negative frequency solutions). It turns out that it is £ and p that 
tell us about the spin of the particle. 

We know the form of the operator for the spin of a particle at rest 
is given by S = - \cr. The object on which this operator acts is p or 

A particle with spin up along the z-axis therefore has £ = m and 


one with spin down along the ^-axis has £ = ( ^ ). We can use these 

states as a basis to represent arbitrary spin states. 

What about antiparticles? We define an antiparticle at rest with 


physical spin up along the ^-direction to have rj = 


, so that S z p = 


— in. An antiparticle with physical spin down along z has S z = }, and 

ry = ^ J j so that S z ri = \r]. Admittedly this is rather confusing! The 

properties of particles and antiparticles are summarized in the table at 
the end of the chapter. 


Example 36.4 

A positive helicity antiparticle with momentum directed along the positive 2 -axis 
must have err) = + 77 . It must also, therefore, have 77 = ^ J ^, which corresponds to 
physical spin down. 


In eqn 36.32 we are again using the no¬ 
tation cr = (7, cr) and a = (7, — cr) and 
we understand that in taking the square 
root of a matrix we take the positive 
root of each eigenvalue. 


What about the spinor for a particle in the general case of particles that 
aren’t at rest? We quote the result here (and will justify it in the next 
chapter). For a particle or antiparticle with momentum p M the spinors 
are given by 


uiv] _ ( 
u[p) [ 


«(p)=( ^%p 

V -Vp^v 


(36.32) 
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Example 36.5 

We will evaluate the spinor for a particle with p^ = (E p , 0,0, |p|) and £ = 

Evaluating the left-handed part u^(p) = yjp • cr£ which lives in the upper 
of the spinor, we have 


/- ( E P ~ \P\ 

/p^=i p 0 


\p\ 


o y / ^ Ep ~ 


y/Ep + [pj 


0 

i y 

two slots 


0 
1 

(36.33) 


which gives u L (p) = s/p • = sj E p + ]p| ( ° ) . 

The analogous result for the right-handed, lower slots of u(p) follows by a similar 
argument with the result that uj^(p) = \/p • <r£ = \J E p — \p\ ^ ^ ^ . 

We obtain the final result for the spinor 


( 


u(p) = 


y/E P + |p| 
y/E p — |p| 


( 0 N 

\ 

V 1 


/ 0 


V 1 V 

/ 


(36.34) 


The general expression for the spinors in eqn 36.32 allows us to explore 
the helicity and chirality of particles and antiparticles in more detail in 
the next example. 


Example 36.6 

We will look at particles and antiparticles in the ultra-relativistic limit, obtained by 
giving the particles a big boost. Consider a particle with spin up along the 2 :-axis and 
momentum |p| along -\-z. This particle, shown in Fig. 36.1(a), has helicity h = 1, 
corresponding to physical spin parallel to the momentum. This is described by a 
spinor 

i 




y/ E P — pa 

y/Ep 


p CT 


( 1 N 

\ 

V 0 , 


( 1 


V 0 y 

/ 


This leads us to 


u(p) = 


( n ) ) 


y/Ep + |p| 


J 


big 

boost 


y/2Ep 


(»\ 

0 

Vo J 


(36.35) 


(36.36) 


which has ul = 0 and ur = yj2E p ^ J J. 

We can carry out the same analysis for a negative helicity particle [Fig. 36.1(b)], 
which has spin down along the z-axis and momentum along -\-z. This has a spinor 

/ o \ 

1 
0 

V 0 / 

(36.37) 


u{p) = 



boost 


y/ 2 E p 


which has ul = yj2E p ^ ^ ^ an< ^ = 



Fig. 36.1 Helicity states of particles 
and antiparticles, (a) A highly rela¬ 
tivistic electron with spin up along the 
2 :-axis and momentum along 2 :, which 
has h = 1. (b) The same with spin 

down, giving helicity h = — 1. (c) A 
highly relativistic positron with phys¬ 
ical spin up along the 2 :-axis and mo¬ 
mentum along 2 :, giving a helicity h = 
— 1 (d) The same with physical spin 
down along 2 : and h = 1. 
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Now for the antiparticle states. Again we take momentum to be alon^ -\-z. An 
antiparticle with physical spin up along the z-axis [Fig. 36.1(c)] has ^ ^ 


It has 


a helicity of +1 (opposite to what a particle would have) and has a spinor 


t 


v(p) = 


\/E — p- a 
-s/E + p ■ o 


\ 


J 


0 

#TR 

o 

V -v^Hpf J 


big 

boost 


y/2Ep 


( ° \ 

1 

0 

V o J 

(36.38) 


This has v^, = y/2E p ( ^ ) and = 0. 

Finally, an antiparticle with spin down along the 2 -axis and momentum along +z 
[Fig. 36.1(d)] has helicity —1 and a spinor 

( \ 

o 


t 


v(p) = 


V 


y/E — p - cr 
— y/E + p <7 


\ 


J 


-VeTW\ , 
o J 


big 

boost 


— \J 2E P 


( 0 \ 

0 

v o j 

(36.39) 


which has = 0 and ur = — yj2 E p ^ J J . 

Notice that highly relativistic particles with positive helicity have right-handed 
spinors, while relativistic antiparticles with positive helicity have left-handed spinors. 


Neutrinos only exist as left-handed 
particles. They are (at least approx¬ 
imately) massless, which means that 
only negative helicity neutrinos and 
positive helicity antineutrinos have ever 
been observed. 



Fig. 36.2 Parity violation in the weak 
interaction. The electron oscillates be¬ 
tween left- and right-handed chiralities. 
It may only emit a W~ particle when 
it is left-handed. 


12 


Imagine that there was an interaction that only coupled to left-handed 
particles. In the ultra-relativistic limit, where chirality and helicity be¬ 
come identical, we’ve demonstrated that only negative helicity particles 
and positive helicity antiparticles would interact. Amazingly, this is in¬ 
deed the case for the weak interaction, which describes a field W fl (x) 
which only couples to left-handed Dirac particles! 11 This means that 
only a left-handed fermion may emit a particle. For a massive elec¬ 
tron, which oscillates between being left- and right-handed, it may only 
emit a W~ particle (turning into a neutrino as it does so) when it is 
left-handed as shown in Fig. 36.2. 

To summarize this section: the information content of a spinor tells us 
about the spin of a particle through the object £ (or rj for an antiparticle) 
which has two degrees of freedom. If we know £ and the momentum of 
the particle then all four components of the spinor (two left-handed parts 
and two right-handed parts) are completely determined. 


36.4 Basis states for superpositions 

We turn to the orthogonality of the Dirac spinors. This property is useful 
if we are to use the spinors as a basis in order to express arbitrary wave 
functions as linear superpositions which contain spinor parts. Spinors 
carry information about the spin of the particle, so we want one spinor 
to be orthogonal to another if they describe different spin states. (The 
orthogonality of momentum states will be handled, as usual, by the 
e“i p x factors.) 

Now, it’s straightforward to show 12 that 


See Exercise 36.3. 


v){p)u{p) = 2 E p tf€, v^(p)v(p) = 2E p rfrj. 


(36.40) 
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However this isn’t going to be very useful. The reason is that these 
expressions are not Lorentz invariant: the combinations ivu and v'v 
are proportional to E pi which is frame-dependent. To make a Lorentz 
invariant quantity we define the adjoint or bar of a spinor as 

u{jp) = v) (p) 7 0 . (36.41) 

This is useful as a Lorentz invariant quantity can now be formed via 

u(p)u(p ) = 2 (36.42) 

Now for the orthogonality: let’s also choose a basis in which to describe 
the spin of a particle. We usually choose the basis = ( n 'l and 


£ 2 = ^ ^ such that = S sr . The result of all of this is that we 

can now write an expression for a Dirac spinor which is suitable for use 
as a basis state as 

u s (p) = ( ) , (36.43) 


where s = 1 or 2 and we have 


s {p)u r {p) = 2 = 2 mS sr . 


(36.44) 


We are now able to express an arbitrary particle solution of the Dirac 
equation as an integral over momentum and a sum over spin states as 
follows: 


= / ,f(, , * 

J ( 2 tt )2 (2E p )2 ^ 


(36.45) 


where a sp is an amplitude here. 

In the case of the antiparticle spinor we pick up a minus sign: 

v s {p)v r (p) = —2 = —2m6 sr , 

and the antiparticle wave function is written 

, + , x /' d 3 p 1 \ ' Z,* f„\Avx 


(36.46) 


V’ + (^)= / T^TsT^WTT Y, b sp vS W eiP ' X ’ ( 36 ‘ 47 ) 
J ( 2 ir) 2 (2E p )2 ^ 

where the components have amplitude b* sp . Finally we note that the ids 
and v ’s are orthogonal: 


u r (p)v s (p) = v r {p)u s {p) = 0 . 


(36.48) 


We summarize all of our definitions in the master table below. 
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Example 36.7 

One of the most useful expressions in subsequent calculations is the sum 


2 


E uS (p) uS (p ) 


E 


Vp ^ 
Vp ■ a 


e + Vp ^ ) ■ 


(36.49) 


So why care about these spin sums? 
We’ll see a little later that when we ac¬ 
tually descend from our ivory tower to 
calculate something that people actu¬ 
ally measure a certain amount of av¬ 
eraging is going to be required. This 
is because the incoming and outgoing 
polarizations of our particles are usu¬ 
ally unknown. In that case the usual 
instruction is to average over initial 
states and sum over final states. This 
will lead to sums over the spin states 
where relaxations like u s (p)u s (p) = 
if + m come in very useful. 


We use the fact that 
2 


»)+(;)(»■>»-(: °) 


giving us a four-by-four matrix 
2 

E“ s ( p )“ s ( p ) = 


Finally we use our 7 matrices to write this four-by-four object as 
y ' u s ( p)u a ( p ) = 7 ■ p + m. 


(36.50) 


. (36.51) 


(36.52) 


It’s left as an exercise to prove the analogous expression for antiparticles 

v a (p)v s (p) = 7 ■ p — m. (36.53) 


36.5 The non-relativistic limit of the Dirac 
equation 

It is often stated that the Dirac equation is required to describe quantum 
mechanical spin. This isn’t quite true as there’s another equation that 
does this job perfectly well at low energy: the Pauli equation given by 

Hip = ^fj^-ip. (36.54) 

2 m 

The Pauli equation emerges naturally as the non-relativistic limit of the 
Dirac equation. 


Example 36.8 


As in Chapter 12, we extract the non-relativistic limit by factorizing out the large 
mass contribution through a wave function ?/>l(£, a;) = c/)i^(t,x)e~ irnt . From Dirac’s 
equation (ft — m)ijj = 0 we obtain 


( ~m E-ct p \ / 0 L (f, £c)e lmt \ 

V E + crp —m J V x)e~ ,rnt ) 


(36.55) 


where E = ido- Noting that Etj> a (t, x)e lmt = e lmt (m + x ) (where a = L 

or R) we obtain 


= 0 , 

= 0. (36.56) 


m0 L + (m + E - <r • p)</> R 
(m. + E + cr- p)b L - m(j > R 
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Rewriting the second of these equations we produce 


4> R 



allowing us to eliminate <Pit from the first equation, so that 



(<T • p ) 2 


4 > l = o. 


(36.57) 


(36.58) 


Recognizing that the E 2 term may be neglected in comparison with the other terms 
at low energy, we end up with Pauli’s equation 

&<$> L = (<T ; P)2 0l. (36.59) 

2m 

o 13 

The same equation applies for </>r too, as can easily be shown. 13 See Exercise 36.8. 


Example 36.9 

The previous example gives us enough ammunition to show that the Pauli equation 
(and by extension, the Dirac equation) implies that the ^-factor of the electron is 
g = 2. We may include the electromagnetic interaction in the Pauli equation by 
making the replacements 14 p —> p — qA and E —> E — qA °. The result is 

(E - qA°)4, = ^ ^ ~ qA )] <f>. (36.60) 


^This is equivalent to the replacement 
d^i —> that we examined in Chap¬ 

ter 14. 


Multiplying out the right-hand side and using the identity cPcr 7 = 5 l i + \e' l ^ k cj k we 
find 15 

{E - qA°)<t> = qA ' > <j>- -£-&■ B<j>, (36.61) 

2m 2m 

where we have identified B = V x A. The expected interaction Hamiltonian of a 
magnetic moment with an external 23-field may be written 


15 

This is made slightly tricky by the 
fact that p and A don’t commute. If 
in doubt, write p = — iV and remem¬ 
ber that the differential operator acts 
on everything to its right. 


H = —jj, B = -g — S■ B. 

2m 


(36.62) 


Comparing this with ■ cr from eqn 36.61 and identifying the spin operator as 

S = we conclude that g = 2. 

The prediction that g = 2 (rather than g = 1) was historically an important one 
in confirming the validity of the Dirac equation. However, it is found experimentally 
that g = 2.002319304.... The reason for this discrepancy is that it is not appropriate 
to treat Dirac’s equation as the equation of motion for a single particle as we have 
done here. In fact, the Dirac equation is correctly interpreted as providing the equa¬ 
tion of motion for fermionic quantum fields. As we shall see in Chapter 41 this will 
lead to a prediction that agrees with experiment to a truly remarkable extent. 


In this chapter we’ve encountered a new object, the spinor, to add to 
the scalar and vector objects that we’ve considered before. We have, 
however, treated the Dirac equation as a single-particle equation. We 
know that in reality particles are excitations in quantum fields. In order 
to be able to define spinor fields we need to know the all-important 
transformation properties of these spinor fields. In the next chapter we 
turn to the Lorentz transformation properties of these objects. 
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Chapter summary 

• The Dirac equation (p — irijif = 0 results from an attempt to take 
the square root of the Klein-Gordon equation. The four compo¬ 
nents are required to represent spin and reflect symmetry under 
the parity operation. 

• The operator ‘p-slash’ is f = 7 ^( 19 ^) where 7 M are the gamma 

/ 0 cr^ \ 

matrices. In the chiral representation = I q ) where 

cd 1 = (/, <r), = (/, -er) and 7 s = i7°7 1 7 2 7 3 . 

• Massless solutions of the Dirac equation are also the eigenstates of 
the chirality operator, and are known as left- and right-handed so¬ 
lutions. Left- or right-handed states with mass are not eigenstates 
of the Dirac equation because the mass term couples them. We 
can think of Dirac particles as oscillating in time between being 
left- and right-handed. 

• There are positive and negative energy eigenstates of the Dirac 
equation. These give rise to particles and antiparticles. 

• The non-relativistic limit of the Dirac equation yields the Pauli 
equation and predicts g = 2 . 


Exercises 


(36.1) An illustration of the reason for anticommutation 
and spin 

(a) Show that the Dirac equation can be recast in 
the form 

(36.63) 

where Ho = a ■ p + f3m and find a and /3 in terms 
of the 7 matrices. 

(b) Evaluate Ho and show that for a Klein-Gordon 
dispersion to result we must have: 

(i) that the a* and (5 objects all anticommute with 
each other; and 

(h) (cd ) 2 = (d ) 2 = 1. 

This provides some justification for the anticommu¬ 
tation relations we imposed on the 7 s. 

(c) Prove the following commutation relations 

(i) j H, L l J = i (p x a.) 1 where L = x x p. 

(ii) = —i(p x a) 1 where S = and we 

define S = ^7 x 7 . 


This tells us that the Dirac Hamiltonian doesn't 
commute with the orbital angular momentum op¬ 
erator L, mandating the existence of another form 
of angular momentum (i.e. the spin S) so that H 
commutes with the sum J = L + S. 

(36.2) (a) Using the facts that 7 lt = — 7 * and 7 0t = 7 0 , 
take the Hermitian conjugate of the Dirac equation 
to show that we obtain the non-covariant form: 

—ido'!/ ,t 7° + idi'0^7* — — 0. (36.64) 

(b) Show that to restore covariance we may multi¬ 
ply through by 7 0 which results in 

+ mip = 0. (36.65) 

(36.3) (a) Verify eqn 36.40. 

(b) Verify eqn 36.42. 

(c) Verify eqn 36.46. 

(36.4) The 7 M matrices and spinors don’t just have one 
form. Any form that obeys the anticommutation 
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relations will also work. Consider, for example, the 
unitary transformation matrix 


(c) (7 5 ) f =7 5 . 

(d) ( 7 5 ) 2 = 1 . 




(e) ( 7 V) f = 7°7V7°. 
( 36 - 66 ) (36.7) (a) Show that 


Use this to transform the Dirac wave functions 
•tjj —> Uip and show that for the resulting wave func¬ 
tion to satisfy the Dirac equation the 7 matrices 
have the explicit form 



0 cP 
-o i 0 


(36.67) 


This is known as the standard or Dirac- Pauli rep¬ 
resentation. 

(36.5) Show that in (l+l)-dimensional spacetime, the re¬ 
quirement { 7 m , 7 1/ } = is satisfied by 7 0 = o 2 
and 7 1 = icr 1 . 

(36.6) Prove the following identities: 

(a) { 7 ^ 1 7 5 } = 0. (36.8) 

(b) ( 7 ^ = 7°7 m 7°- 


m(p , )<7 MI '(p' — p) v u(p) (36.68) 

= ^u(p) [~7 M (/- m) + {m-f) 7 m ] u(p), 

where a' 11 ' = | [ 7 ^, 7 "] • 

(b) Show 7 ^ = 2 p^ — 

(c) Use these two results to prove the Gordon 
identity 

1 av 

u(p)^u(p) =u(p) ——- V — " u(p), 

(36.69) 

where q M = (p' — p) M . This identity will be used in 
Chapter 41. 

Take the non-relativistic limit of the Dirac equation 
and find the equation of motion for 
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^ The Wisconsin State Journal's humor¬ 
ous writer ‘Roundy’ began his article as 
follows: I’ve been hearing about a fel¬ 
low they have up at the U. this spring 
- a mathematical physicist, or some¬ 
thing, they call him - who is pushing 
Sir Isaac Newton, Einstein and all the 
others off the front page. So I thought 
I better go up and interview him for 
the benefit of the State Journal readers, 
same as I do all the other top notchers. 
The article seems almost too good to be 
true, and unfortunately it seems that 
it is in fact a fake. Joseph Coughlin 
(nicknamed Roundy because of his ro¬ 
tund appearance) certainly existed but, 
as detailed in Graham Farmelo’s biog¬ 
raphy of Dirac The Strangest Man, the 
article is most probably a spoof. Dirac 
however kept a copy of it. 


2 • 
Recall that we define tan 4> l = (3 X . 


Q 

The commutation relations are 

.)',./'] = i e ijk J k , 

[J\K j ] = i e ijk K k , 

[K\Ki] = -i e ijk J k . 


How to transform a spinor 


‘Do you ever run across a fellow that even you can’t under¬ 
stand?’ 

‘Yes, ’ says he. 

‘This will make a great reading for the boys down at the of¬ 
fice, ’ says I. ‘Do you mind releasing to me who he is?’ 

‘Weyl, ’ says he. 

The interview came to a sudden end just then, for the doctor 
pulled out his watch and I dodged and jumped for the door. 

But he let loose a smile as we parted and I knew that all the 
time he had been talking to me he was solving some problem 
that no one else could touch. But if that fellow Professor 
Weyl ever lectures in this town again I sure am going to take 
a try at understanding him! A fellow ought to test his intel¬ 
ligence once in a while. 

Roundy interviews Professor Dirac (1929), Wisconsin State 
Journal 1 

The humble electron cannot be described by either the scalar or vector 
field theories that we have discussed thus far. The electron, and indeed 
all spin-i fermions, are excitations in a spinor field. The different 
sorts of field are classified by how they transform under rotations and 
Lorentz transformations. We know how scalars transform: by definition 
they don’t change. We also know how to Lorentz transform and rotate 
vectors. In this chapter we discuss how to transform a spinor. 


37.1 Spinors aren’t vectors 

Recall that a general Lorentz transformation and rotation can be written 
as a single matrix: 

D{0,<j>) = (37.1) 

where 6 gives angle of rotation and <fi gives the velocity boost. 2 This 
general transformation may be applied to scalars, vectors or tensors by 
picking the appropriate form of the rotation generator J and boost gen¬ 
erator K, subject to their all-important commutation relations. 3 We’ve 
seen how rotations and boosts affect scalars and vectors, but how do 
they affect spinors? We have a problem in that our previous explicit 
expressions for K are only geared for vectors, which are objects whose 
rotations are described by the group SO( 3). We will see that spinors 
are rotated by the elements of a different group. 
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37.2 Rotating spinors 


As Eugene Wigner proved, the rotations of spinors are generated by the 
elements of the group SU( 2). This is the group of two-by-two unitary 
matrices with unit determinant. The generators are the Pauli spin matri¬ 
ces whose commutation relations are [cr 1 , <r J ] = 2i £^ k a k . This comes as 
no surprise since all of us learn at our mother’s knee that two-component 
S' = 1/2 spin states are acted on by an angular momentum operator 
J = " and the angular momentum operator is the generator of rota¬ 
tions. The set of matrices that rotates two-component Weyl spinors is 
therefore 

D(6) = e - i°" e . (37.2) 

These operators rotate the left-handed Weyl spinors tpi,(p) and the 
right-handed Weyl spinors ^r(p) in exactly the same way, so the four- 
component Dirac spinor is rotated by the matrix 


( \ ^ ( e ' 2<r ' 0 0 ^ 

\ V’r, / V 0 e~^ e J \ t/j R J ' 


(37.3) 


Example 37.1 

With this knowledge we can calculate the 2x2 matrix that rotates Weyl spinors 
around a particular axis. We’ll try the rotation by an angle 0 1 about the ^-direction, 
whose matrix is 4 ^Recall eqn 15.23. 

ii 6 1 9 1 

D{8 1 ) = e~ 2 a 6 =/ cosy - id- 1 sin—. (37.4) 

To rotate four-component Dirac spinors we simply act on both the left- and right- 
handed spinors with this 2 x 2 matrix. 


37.3 Boosting spinors 

Next we examine how to boost spinors by finding an explicit form for 
the boost generators K. Since we know that spinors are rotated by the 
Pauli matrices via J = \cr and we know the commutation relations of 
the generators K with themselves and J, we guess that K oc er. We 
find that two possible choices of proportionality constants satisfy the 
commutation relations. It is easily checked that both 

K = ±] J (37.5) 

will work. It seems, therefore, that there must be two sorts of spinors: 
those boosted by K = + and those boosted by K = — and the two 
choices of sign correspond to two distinct representations of the group. 
At rest, these two sorts of spinors should be indistinguishable. This is 
indeed correct and the two sorts of spinors are exactly the left- and right- 
handed Weyl spinors, l and -!/)r, that fell out of the Dirac equation in 
the previous chapter. 
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Example 37.2 


To show that the Weyl spinors are transformed by different representations of the 
group we should be able to show that each representation obeys the Lie algebra of 
the generators of 51/(2), which is = i €^ k C k . This is possible if we define 

J±iK 

J± = -. 

2 

(37.6) 

since then the commutation relations read 


[j l +,ji] = i£ ijk J +. 

, (37.7) 

[jL,JL\ = i£ ijk J- 

(37.8) 


Our new generators J+ and J _ now have independent Lie algebras and each sep¬ 
arately generates the rotations and boosts for a type of Weyl spinor. The simplest 
non-trivial case of this algebra corresponds to one or other of the new generators 
being zero, which gives the rules 


J = —i K for J+ = 0, 
J = i K for J— = 0. 


(37.9) 


We then have generators J = <x/2, K = ier/2 for the Weyl spinor and generators 
J = o"/2^K = — ier/2 for the Weyl spinor -0R, leading to our final answer for the 
transformations appropriate for Dirac spinors: 

/ d^o,4>) o \ / 

4>n ) V 0 (0. <W ) \ ipR 

where 


(37.10) 


d l (o,4>) 

Dr (6,4,) 


e f-(-i 0 +<2>). 


(37.11) 


After all of this formalism, we’re left with the answer to our question of 
how to Lorentz boost a Dirac spinor. The Dirac wave function, which is 
made up of two spinors stacked on top of each other, is boosted according 
to if) —> e lK ^4}, or 


( V>l \ ( 0 \U\ 

V i’R ) V 0 )' 


(37.12) 


Example 37.3 

We can get some practice by boosting the spinors along the ^-direction. 


(in. ( e-h * 3 * 3 

l / V o 


0 


bL A 

V’R J 


The elements of the boost matrix are given by 

I 1 3 ,3 (J^dP 

e ± 5 a 0 = 1± _T_ + 


a 3 <f> 2 


±... 


= /cosh(0 3 /2) ± cr 3 sinh(</< 3 /2). 


So the boost is 

V’R J 


I cosh ^-o' 3 sinh A- 


0 

I cosh ^- + o' 3 sinh A- 


(37.13) 


(37.14) 


V>l A 
V>R J 


o 


(37.15) 
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Reverting to 4 X 4 matrices we see that the components are boosted by a matrix 
/ 

/ e 2 0 0 

0 e 2 0 


V 


\ 


0 


(37.16) 


e 2 / 


Because e± x = cosh x zb sinh x we can write = yj cosh </> 3 =L sinh Identifying 

cosh </> 3 = 7 = E/m and sinh (/> 3 = (3 Z 7 = \p\/m we see that a boost along z is 
enacted by a matrix 


( Ve^]p\ 
0 
0 

V 0 


0 




0 

0 

v^+lpf 

0 


\ 


(37.17) 




We found previously that the Dirac wave function for a particle at rest is given by 


U L 

UR 


= \Jm 

z-direction has the following effect 

( 


. Putting everything together we deduce that a boost in the 


S' 

V u t J 


= sfm 


( e \ 
e 
e 

\e j 


t \ 

VE+jp\ e 

VE+\p\ e 

V v e ~ ipi ? 2 / 


This provides some justification for the expression 

u(p ) = 

as we claimed in the previous chapter. 


(37.18) 


(37.19) 


37.4 Why are there four components in 
the Dirac equation? 

The Dirac equation is written in terms of a four-component Dirac spinor 
ip. These Dirac spinors have two degrees of freedom: the two components 
of £ (or rf). So why do we have four components in the Dirac equation? 

We found that for the case of massless particles the Dirac spinors 
fell apart into two: left-handed Weyl spinors and right-handed Weyl 
spinors. Each of these spinors carried the same spin information in £ (or 
rf). The question is, then, why does the world need left- and right-handed 
fields. We will answer this question by asking what fundamental opera¬ 
tion turns left-handed spinors into right-handed spinors? The answer is 
parity. 

Many of the properties of fermions are invariant with respect to parity 
and so we require that Dirac particles have the same properties in a 
parity reversed world. (The weak interaction violates parity, but we 
ignore that for now.) The parity operation P does the following: it 
maps x —r —x and v —> — v and so it results in the changes to the 
generators: 


P-^P = -K, P" 1 JP = +J 


(37.20) 
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(recall that we say that if is a vector and J is a pseudovector). The 
parity operation therefore turns a left-handed spinor into a right-handed 
spinor 

Pf/'L —> "0R, PV’R —► V'l- (37.21) 


An explicit form for the parity operator, suitable for use on Dirac spinors, 
0 1 
I 0 


is P = 7 ° = 


As we have stressed, the Dirac wave functions are not actually single¬ 
particle wave functions: they represent fields. In the next chapter we 
will quantize these fields to make them operator-valued, so that when we 
input a position in spacetime we output a field operator. Dirac particles 
are excitations in these quantum fields. 


Chapter summary 

• We have derived the transformations for rotating and boosting 
spinors. 

• Parity, enacted with the 7 0 matrix, swaps left- and right-handed 
parts of the wave function. 


Exercises 

(37.1) (a) Verify eqn 37.4. 

(b) Show that the rotation matrix 

D{9 3 ) =e~^ e3 = ( ® 2 ° 3 V (37.22) 

V 0 eT/ 

(37.2) Show that both choices of sign K = obey the 
necessary commutation relations. 


(37.3) A one-line derivation of the Dirac equation. 

(a) Given that the left- and right-handed parts of 
the Dirac spinor for a fermion at rest are identical, 
explain why we may write ( 7 0 — l)rt(p°) = 0 . 

(b) Prove that e lif '* 7 °e _lif = fl/m. 

(c) Use the result proven in (b) to boost 

( 7 0 - l)it(p°) = 0 , 

and show that you recover the Dirac equation. 
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After examining the Dirac equation as an equation of motion for single 
particles and investigating the transformation properties of the spinor 
solutions we now turn to quantum fields that obey Dirac’s equation as 
their equation of motion. We will approach the problem with a variety 
of the weapons we have developed in previous chapters. We start with 
canonical quantization. We will write down a Lagrangian density whose 
equations of motion are the Dirac equation. We will then quantize using 
the rules. The quantized field that we will produce is of fundamental 
importance since its excitations describe fermions such as the quarks 
and leptons of particle physics and various excitations in solids such as 
quasielectrons in metals. In order to carry out the sort of calculations 
that can make contact with experiments, such as scattering, we will need 
a propagator and Feynman rules. We will obtain the propagator with the 
path integral. Finally the properties of Noether’s current will motivate 
us to write down the Lagrangian for perhaps the most successful of all 
quantum field theories: quantum electrodynamics. 


38.1 Canonical quantization and 


Noether current 341 

38.2 The fermion propagator 343 

38.3 Feynman rules and scatter¬ 
ing 345 

38.4 Local symmetry and a gauge 

theory for fermions 346 

Chapter summary 347 
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38.1 Canonical quantization and Noether 
current 

To quantize a field we start by writing down a Lagrangian density de¬ 
scribing classical fields. It seems rather wrong-headed to write down a 
classical equation for a field that describes fermions. Although we can 
make a model of bosonic excitations like phonons out of masses and 
springs there is no such obvious classical analogue for fermions. Un¬ 
daunted, we will write down the Lagrangian by choosing a form that 
when inserted into the Euler-Lagrange equation outputs Dirac’s equa¬ 
tion. The Lagrangian that leads to the Dirac equation is given by 

C = ip{ i0 — m)ip = ip( i7 M 5 M — m)ip , (38.1) 

where ip = ip^ 0 . Just as free scalar field theory describes fields whose 
equation of motion is the Klein-Gordon equation, this Lagrangian de¬ 
scribes fermion matter fields whose equation of motion is the Dirac equa¬ 
tion. 

We can use this Lagrangian to canonically quantize the Dirac field. 
This field has two components, just like the complex scalar field. 
The momentum conjugate to ip is easily calculated using the ordinary 
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method. We find that 

(38 - 2) 

and Ihi = 0. We obtain a momentum field 

n°=ty7° = tyt. (38.3) 

The Hamiltonian follows in the usual fashion from TL = n 3 SoV’ — £, 
yielding 

TL = ip\— i 7°7 • V + my 0 )?/’- (38.4) 

However, we also know another piece of information, namely the Dirac 
equation itself which simplifies things here. The Dirac equation tells us 
that i 7 0 do?/’ = (—ry- VThis allows us to rewrite the Hamiltonian 
as 

TL = -i/^iSo ip- (38.5) 

This looks much simpler! Next we make the fields quantum mechanical 
1 We also put hats on if and if t to make by imposing equal-time anticommutation relations: 1 
the point that they are now operators. A . 

{Mt,x)^l(t,y)}=S^(x~y)S ab , (38.6) 

{i>a(t,x),4> b (t,y)} = {i>l(t,x),i>l(t,y)} = 0, (38.7) 


where a and b label the spinor components of the four-component field 
operators i/>( x ). The mode expansions of these quantum fields are given 

by 


ip(x) 


d 'X 1 r E(^(P) 

(27t)I(2 E p )if^y 


*~ lp ' x ±n s {p)b\ e ip ' x 


a sp e p ' x + v 


d 3 p 1 


(2tt)2 ( 2 E p )2 ^ 


)E(“ s w 


dtpjP-* +v s (p)b sp e~ ipx ) . (38.8) 


The creation and annihilation operators must themselves obey the anti¬ 
commutation relations 


{a sp ,a(,q} = {b sp ,bl q } = 6 {3 \p- q)S sr . (38.9) 

Now it simply remains to insert the mode expansion into the Hamilto¬ 
nian and normal order to obtain a quantized field theory. 


Example 38.1 

We obtain 

H = 


/ 


d^a:- 


d 3 pd 3 <j 


- £ (u s (p)at p e ip * + v s (p)b sp e- ip *) 


(27r) 3 (2Ep2E,)3 

X7°E, (u r ( 5 )a r ,e- , «-“ -w r ( 5 )8t,e i «" e ) 

= f ( uS Hp)atp uS (p)asp-v s Hp)b sp v s (p)bt p y (38.10) 

s,r J 

Lastly, we can use the properties of the normalized spinors to do the spin sum, since 
we know that u s ^u r = 2 E p 5 sr . 
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The (normal ordered) Hamiltonian becomes 
, 2 

H = / d 3 p^2E p (al p a sp + bl p b sp ^j . (38.11) 

S = 1 

We see that the energy is given by the sum of the energies of the particles 
and antiparticles, exactly as we should expect. 

The Lagrangian also has a global U( 1) symmetry, meaning that the 
Lagrangian is invariant with respect to the change ip{x) —> ip(x)e' a , so 
we can use Noether’s theorem to find the conserved current and charge. 


Example 38.2 

With ip —> ipe ul we also have 7 —> ipe~ Since the transformation is global we also 
can write —> e ioc d p ip. Putting this all together we get 

C = ip(ipi — m)ip 

—» b e_1 “(i^ — m)ipe la = ipfif) — m)ip. (38.12) 


The use of Noether’s theorem goes through similarly to the case for the 
complex scalar field, except that we have 

% = fof, = 0, (38.13) 


and Dip = i ip. So, upon normal ordering and swapping signs to give 
the conventional number current direction, we obtain a Noether current 
operator 

% c = (38.14) 

We can use this to work out the conserved 17(1) charge which, upon 
substitution of the mode expansion, yields up 


Qnc = [ d3 P^2 ( at p a sp - b\ p b sp ) . (38.15) 

The conserved charge is given by the number of particles minus the 
number of antiparticles, as expected. Once again, this is just like the 
complex scalar field case. 

38.2 The fermion propagator 


Noether’s theorem: U( 1) internal 
symmetry 

Di(j = i-il) 

Db f = -i 

% = 

up = 0 

ip 

DC = 0 

= 0 

, 4 0 = 

J Nc = 

Qnc = / d 3 ?E 

2 (n (a) -n (b) ) 

s = l \ n, sp "'sp J 


The propagator for fermions can be worked out by calculating Gq(x, y ) = 
(0\Tip(x)ip(y)\0). However, we can also use the path integral. For 
fermions we treat the classical fields ip and ip that appear in the path 
integral as Grassmann numbers . 2 In terms of the fields ip and ip, the 
generating functional is written 


See Exercise 38.4. 

o 

See Chapter 28 for an introduction to 
Grassmann numbers. 


^[ 77 , 77 ] = / V'lpUl/je 


i f d 4 x 'ip(x)(j.0-mj'ip(x)-\-r}(x) , iJj(x)-\- , iJj(x)r)(x) 


(38.16) 
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where we notice that both and ip{x) must be coupled to separate 
source fields fj(x) and rj(x) respectively. The leg-work for this calculation 
was carried out in Chapter 28 and so we note that the result is 

Z[fj,rj\ = Ce~^ dixd4v ^ x) ^- m) ~ 1 ^ v) , (38.17) 

where C is proportional to a determinant. As usual, we normalize by di¬ 
viding through by Z[ 0,0] to obtain the normalized generating functional 


Z[fj,rj = e~ J d ‘ l ‘ xdiv v(x)iS(x- y ) r i( y )^ 


(38.18) 


where S(x — y) = (i0 — to) - 1 . The propagator is then the solution of 
the equation 

(i0 — m)iS(x) = k>G)(x). (38.19) 

Note that there is a difference of a mi- The solution for the fermion propagator may be read off from this 3 as 
nus sign in the equations defining the 

scalar and fermion propagators in terms f d 4 p i e -w<*-v) 

of the Green’s function of the equations Go(x,y) —i S(x — y ) — j 77 ^ 7 ^ 4 ^-~—j—— • (38.20) 

of motion. 


(27r) 4 p — to + ie 


(38.20) 


It’s important to note that the fermion propagator Go(x,y) is a four- 
by-four matrix. This is revealed in momentum space if we multiply top 
and bottom by (ti + to) to give 


Go (p) = 


i i(p + to) 

I — to p 2 — to 2 + ie 


p 2 — to 2 + ie \ p + pa 


p” — p ■ a 


(38.21) 



Fig. 38.1 The fermion propagator in¬ 
terpreted in terms of left- and right- 
handed spinors. 


where the latter equation is given in chiral representation. An interpre¬ 
tation of the four-by-four fermion propagator is revealed if we consider 
the roles of the left- and right-handed parts of the Dirac spinor. The 
fermion propagator G o is formed from the combination il)(x)ip(y) which, 


after substituting ^ 


i’R 


has the form 


/ V>l( 2 #r( 2/) V’daOV’iXy) 

V 0r(x)( x)ipl(y) J ' 


The top-left slot deals with a right-handed part entering and a left- 
handed part leaving. In contrast, the top-right slots tells us about 
a left-handed part entering and a left-handed part leaving. Recalling 
that a particle propagator is formed from a series G = Gq + GqVGq + 
GqVGqVGq + GoVGqVGqVGo + ■ ■., we may multiply the propagator 
matrices and obtain the matrix represented in Fig. 38.1. We see that 
the top-right slot still describes a left-handed part entering and a left- 
handed part leaving, but now expresses this as a superposition of all of 
the possible oscillations between left- and right-handed parts that result 
in the left-handed part leaving. The other slots of the matrix represent 
the other combinations of chiralities entering and leaving. In this sense 
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we may think of a massive Dirac particle propagating while oscillating 
between left- and right-handed. 

With this in mind, our problem is solved since we have found the 
propagator for free fermions. This enables us to start solving scattering 
problems through the use of Feynman diagrams and it is the Feynman 
rules for fermions that we turn to next. 


38.3 Feynman rules and scattering 

The generating functional Z[fj, rj\ can be used to find the Feynman rules 
for fermions. We won’t do this, but simply list the rules in momentum 
space (see also Fig. 38.2). 

Feynman rules for fermions 

• A factor i /(ft — m + ie) for each internal fermion line. 

• A factor u s (p) for an incoming fermion with momentum p and 
spin s. 

• A factor V s (p) for an incoming antifermion with momentum p 
and spin s. 

• A u s (p) for an outgoing fermion with momentum p and spin s. 

• A V s (p) for an outgoing fermion with momentum p and spin s. 

• Trace over the matrix product arising from a fermion loop. 

• Add minus sign factors reflecting fermion statistics, the most 
common of which is a factor —1 for every fermion loop. 

We will now get some practice with the use of these rules. 


>- > - < 


(b) 


V' 


p — m + ie 


P 


A - en S 


u s (p ) 


(c) y 

P f.' =v S (p) 


(d) 


P 


= u s {p) 


(e) 


P\ 




= v s (p) 


Fig. 38.2 Feynman rules for fermions. 


Example 38.3 

Let’s use our rules to work out some Feynman diagrams for a theory of scalar phions 
interacting with electrons, via the interaction C\ = This is very much like 

the example of the complex scalar field interaction in Chapter 11, except that we 
need to consider the complication of the spinor polarizations. 

We can start with electron-electron scattering mediated by the ^-channel exchange 
of a virtual phion with mass m^, as shown in Fig. 38.3(a). Applying the rules, we 
obtain an amplitude 

i Mi = (-i g) 2 u s '(p')u s (j>) — -^- ^u r '(k')u r (k), (38.23) 

(p - pr - 

where we have ignored an overall energy-momentum conserving delta function. Next 
we consider electron-electron u-channel exchange scattering shown in Fig 38.3(b). 
Swapping the lines over corresponds to swapping operators in the 5-matrix element, 
so this diagram comes with an extra minus sign. Applying the rules, we obtain 

IM 2 = -(-i fl )WK(p) , , 1 - r u«'(p> r (fc). (38.24) 

(p — K) — 

We’ll get a little further by considering the non-relativistic limit for distinguish¬ 
able particles. This means that only the ^-channel diagram contributes. For non- 
relativistic particles, we have the approximate properties: 

p«(m e ,p) p'«(m e ,p') fc«(m e ,fc) k'tt(m e ,k') 



Fig. 38.3 (a) Electron-electron t- 

channel exchange. (b) Electron- 
electron u -channel exchange. 


(38.25) 
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4 This may be compared to the version 
in Chapter 20, which only differs in 
normalization and the absence of spin 
conserving Kronecker deltas. Here, as 
before, the Yukawa potential is attrac¬ 
tive. In the next chapter we will re¬ 
cover the expected result that the elec¬ 
tromagnetic interaction scattering like 
charges is repulsive. 


5 We write^= 7 = ^(dp+iqAy). 


6 It is also worth noting that the elec¬ 
tromagnetic field tensor may be 

generated by evaluating the commuta¬ 
tor [D^,D U \ = iqF^ u , as examined in 
Exercise 46.3. 


and, for spinors, that u s (p) 



The spinor products become 


u s ( p)u s (p ) = 2 m e £ s 7* = 2m e S 3 i s , 


(38.26) 


showing that the scattering can’t change the spin. Putting these contributions to¬ 
gether yields a scattering amplitude 4 


i M = 


4i g 2 rnf 


(p - P 1 ) 2 + m ? 


(38.27) 


We will see more examples of scattering in the next two chapters. 


38.4 Local symmetry and a gauge theory 
for fermions 

Our final job in this chapter will be to conjure up a gauge theory for 
fermions. Why do this? We know that electrons interact with elec¬ 
tromagnetic fields (since they carry electric charge), and so we there¬ 
fore plan to extend the gauge field Lagrangian of electromagnetism 
£ = —\F I1V F ,J;V to include fermions and an interaction term telling 
us how photons and electrons interact. The gauge principle that we 
used in Chapter 14 allows us to do this since the act of promoting the 
Dirac Lagrangian to a gauge theory by making it locally (7(1) symmetric 
will provide us with an interaction term through the minimal coupling 
prescription. 

As we’ve seen, the Dirac Lagrangian is invariant with respect to 
global (7(1) transformations, which is to say that the internal trans¬ 
formation ip —> ipe la leaves the Lagrangian invariant. The result of this 
is a conserved number current Jn c - To make a gauge theory we will 
promote the global (7(1) symmetry to a local symmetry exactly as we 
have done before in Chapter 14. We simply replace our transformation 
with ip —> e la ( x -* and introduce a gauge field via a covariant derivative 
Dfj, = dfj, + i qA^{x). In order for the gauge field to ensure a local sym¬ 
metry, it must itself transform as A^(x) —> A^(x) — 7<9 M a(x). 

The Dirac Lagrangian, fixed up to have a local (7(1) invariance, is 
then given by 5 

£ = Pp(iEf-m)ip. (38.28) 

Inserting our covariant derivative, we have £ = ip (iy^ [d^ + i qA^\ — m) ip 
or 

£ = Pp (i0 — m) ip — qPp/Aip- (38.29) 

Lo and behold, we are told how the gauge field A^(x) interacts with 
our fermion fields ip(x) and ip(x): it’s via the interaction term £\ = 
—qipj^A^ip. This method of determining the form of the interaction 
merely by considering the consequences of the substitution — 

qAn is another example of the minimal coupling prescription we saw in 
Chapter 14. 6 
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In classical electromagnetism the conserved currents are the 
sources of the electromagnetic fields. These fields interact with the elec¬ 
tric charges q 1 telling them how to move. This is encoded in an inter¬ 
action between the electromagnetic current density Jg m (x) and the field 
A^{x) giving a contribution to the Lagrangian C\ = —J^ m (x)A IJ ,(x), 
where is a conserved current obeying d^J£ m = 0. Our minimal 
coupling prescription motivates us to make the step of identifying the 
fermion Noether current J^ c (x) = ip^ip with the electromagnetic cur¬ 
rent via Jg . n = qJ ^ C ) where q is the electromagnetic charge. This con¬ 
strains the source currents that may be coupled to the electromagnetic 
field and guarantees that the source current is a conserved quantity. 
This philosophy is illustrated in Fig. 38.4. 

We end this chapter with a landmark equation. We will write down 
perhaps the most successful theory in modern physics: Quantum elec¬ 
trodynamics or QED. The QED Lagrangian includes the contributions 
from the gauge field of electromagnetism (eqn 5.49) and from the lo¬ 
cally gauge invariant Dirac Lagrangian describing fermions and their 
interactions (eqn 38.28) giving 

C = - ^Ffj U F^ + $ — m)ip — qxp^A^ip. (38.30) 

In the next chapter we will examine the predictions of this theory. 



covariant derivative 
^^^ninimal coupling 

iNTEKftCTToTiTsl 


Chapter summary 


Fig. 38.4 The minimal coupling pre¬ 
scription in QED. 


• The Lagrangian that leads to the Dirac equation is C = ip(i0—m)ip 
where xp = t/dq 0 and can be quantized. The Noether current is 
J^c = 

• The fermion propagator is i/{p— rn). 

• In order to have local gauge invariance, minimal coupling tells us 
that we must add to £ a term C\- — q^'y^A^xf). 


Exercises 


(38.1) Try to canonically quantize the Dirac Lagrangian 
using commutation relations rather than anticom¬ 
mutation relations for the fields. You should get a 
minus sign in front of the b p b p term in the Hamil¬ 
tonian. Explain why this is a catastrophe. 

See Peskin and Schroeder, Chapter 3 for help. 

(38.2) Verify eqn. (38.15). 


(38.3) Show that the massless Dirac Lagrangian is invari¬ 
ant with respect to the global chiral 17(1) transfor¬ 
mation xp(x) —* e 1 “’ T xp(x). Show that the Noether 
current is J£ c (x) = l 4>(x)^”f b xp(x). 

(38.4) Calculate the free fermion propagator by inserting 
the mode expansion into the equation Go(x,y) = 
{0\Txp(x)'ip(y)\0). You will need to use the spin sum¬ 
mation rules from Chapter 36. 
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Sin-Itiro Tomonaga (1906-1979). 


Quantum electrodynamics (QED) is the quantum field theory that de¬ 
scribes the interaction of light and matter. Arguably one of the greatest 
intellectual achievements ever made, its discovery was due to several 
people but notably Tomonaga, Schwinger and Feynman, who shared 
the Nobel Prize in 1965. 

Since QED describes an interaction it will come as little surprise that 
this is not a solvable theory and we must rely on perturbation theory 
and Feynman diagrams to understand the results of experiments. In this 
chapter we take the view that the Feynman diagrams are the theory and 
will therefore concentrate on finding the propagator for the photon and 
the Feynman rules for the photon-fermion interaction. One complication 
is that electrodynamics is a gauge invariant theory so we must expend 
some effort in ensuring that our dealings with the photon preserve this 
invariance. We will find that this can be done and, on the way, calculate 
our first amplitude in QED! 


^ee eqn 24.29. Note that here we 
follow the convention, introduced in 
Chapter 17, that the free photon prop¬ 
agator is called Dou V {k). 


39.1 Quantum light and the photon 
propagator 

Our task is to find the propagator for photons. We might guess that it’s 
given by the m —> 0 limit of the propagator for the massive vector field 
we found 1 in Chapter 24, which is to say 


2 All photons that we detect actually in- 
teract with electrons in detectors such 
as the eye. They must all then, in some 
sense, be virtual! How can this be? 
We know that particles that are off- 
shell have the range over which they 
can propagate limited by the extent 
to which they’re off-shell. If we see 
photons that have travelled from dis¬ 
tant stars they have to be pretty close 
to being on-shell. We’ve seen before 
that when a particle is on-shell we hit 
the pole of the particle’s propagator. 
Therefore photons from Andromeda, 
visible on a moonless night, must be so 
close to the pole that there can’t be any 
observable effects from being off-shell. 


D^u{k) 


,. i(-5^ + k^kv/m 2 ) 

lim - ry -- 

m—>o /e z — m z + le 


(39.1) 


However there’s a problem in taking this limit: the (longitudinally pro¬ 
jecting) term k^k^/m 2 threatens to blow up for photons on the mass 
shell, which have k 2 = m 2 —> 0. We will show later that this disaster 
does not occur and that gauge invariance allows the term to 

be removed. Accepting this step for now, we are left with the photon 
propagator, given by 

Do,Ak) = (39.2) 

As usual the propagator describes the contribution from a virtual par¬ 
ticle, in this case the photon. By virtual we mean that the energy 
Ek |fe|, where k is the three-momentum of the photon. 2 
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It’s worth checking that our propagator correctly predicts the mech¬ 
anism through which photons mediate the interaction between charges. 
Let’s consider the fate of two electric charges interacting via the ex¬ 
change of a photon, as shown in Fig. 39.1. Since we’re interested in 
the photons we’ll take the charges as forming electric currents This 
has the advantage that the fermion and vertex parts of the amplitude 
are contained 3 in the current J leaving us free to concentrate on the 
physics of the photon. 

We therefore consider the amplitude 

A = (^|F) -C (39.3) 

where we’ve left out the ie, since its presence won’t be important for 
what follows. 


Example 39.1 

If we work in a frame where Id 1 = (fc°, 0, 0, fc 3 ), then the amplitude looks like 


i (Jg Jg - Ja • J b) 
(fc 0 ) 2 - (A; 3 ) 2 


(39.4) 


Current conservation implies that k^J^ = 0, so we have k° J° — k? J 3 = 0, which 
allows us to eliminate the component J 3 to yield 


A = i 


J2 4 ° 

(fc 3 ) 2 


( fc °) 2 - ( fc 3 ) 2 ' 


(39.5) 


Now for some interpretation. 4 The first term is in terms of J°, the charge density. 
If we (inverse) Fourier transform this quantity we obtain an instantaneously acting 
Coulomb potential, which is repulsive between like charges 


/ 


d 4 fc 

(^o 4 


JO J0 
-i k-x J & J b 


47t|t*| 


S(t a ~ t b ). 


(39.6) 


Don’t worry about the instantaneousness of the term. It only looks unphysically in¬ 
stantaneous because we’ve split up the propagator in a non-covariant manner. More¬ 
over, this is the term which dominates in the non-relativistic regime. This Coulomb 
interaction is of course the basis of much of condensed matter physics. 

We argued above that the photons we observe are those very close to the pole. For 
our case of photons propagating along the z- (or 3-) direction, we look at the residue 
of the second term and we see that there seem to be two sorts of photon: those that 
couple J 1 currents and those that couple J 2 currents. These are the two physical 
transverse photon polarizations. 


39.2 Feynman rules and a first QED 
process 

We now turn to the interaction of the photon with a fermion. We saw 
in the previous chapter 5 that the result is an interaction term which 
expressed in Hamiltonian form is 

Hi = 


3 n 

This assertion is justified later in the 

chapter. 



= <Pl> o 7 m ^o Jb = # 67^6 

Fig. 39.1 The exchange of a virtual 
photon. 


^ Note that in the general case, where 
kv = (fc°, k), the first term in eqn 39.5 
will be iJ^jQ/k 2 . 


Remember that here, as elsewhere in 
the book, q = Q |e|. 


(39.7) 
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= -i<n M 


l 9gv 


k 2 + i 


le 



The diagrammatic version of this interaction is shown in Fig. 39.2(a). 
This one simple vertex, whose translation in words is ‘fermions can emit 
or absorb virtual photons’, governs the entire theory. 

We now have the ingredients to state the Feynman rules for quantum 
electrodynamics: 

Feynman rules for QED 

• Use all of the rules given previously for fermions. 

• The interaction vertex contributes —i< 77 ^, where q(= Q |e|) is the 
charge [Fig. 39.2(a)]. 

• Every internal photon line contributes D 0 ^ u (k) = ie) 

[Fig. 39.2(b)], 

• Incoming external photon lines contribute a polarization vector 
Qu (?) [ Fi §- 39.2(c)]. 

• Outgoing external photon lines contribute e* A (p) [Fig. 39.2(d)]. 

To get the feel of the rules, we will examine the case of e + e~ —> /i + p _ , 
shown in the Feynman diagram in Fig. 39.3. Using the Feynman rules 
for fermions and photons, we obtain an invariant amplitude of 


Fig. 39.2 The QED Feynman rules. 


iM = v s (p')(—iQ\e\^)u s (p) 



u r (k)(-iQ\e\Y)v r \k'), (39.8) 



e e 


Fig. 39.3 A first QED process: 
e~*~e _ —> 

Our treatment here follows Peskin and 
Schroeder. 

^Remember that relativistic, left- 
handed fermions have h = — 1 while 
left-handed antifermions have h = +1. 

Y 

See Chapter 36 if in doubt. We also 
assume here that E = E p ~ E p /. 


where we’ve written the charge as Q|e|. Note that the sign of the charge 
doesn’t matter for many calculations and we will drop Q in what follows. 


Example 39.2 


We’ll get some more practice manipulating spinors in these calculations by consid¬ 
ering this process in the ultra-relativistic limit. We imagine the kinematics for the 
process to be those depicted in Fig. 39.4. Remember that in the ultra-relativistic 
limit particles in chirality eigenstates will simultaneously be in helicity eigenstates. 

We’ll start with a right-handed electron with initial momentum along -j-z. A 
relativistic, right-handed electron always has helicity h = +1 and so must have a 

spinor £ = (i) , corresponding to a physical spin-up along z. We will collide 


this electron with a left-handed positron with initial momentum along —z. A left- 
handed, highly relativistic positron! 6 has h = +1, so its spinor is given by £ = m. 


corresponding to physical spin-up along z. Taking the limit of very large momenta 7 
we have spinors for the incoming electrons given by 


u(p) 


( o \ 


V2E 


V 0 J 


v(p') 


( 

V2E 

V 


0 

0 

0 

-1 


\ 


(39.9) 


To calculate our matrix element we need to evaluate products looking like u'y^u = 
It’s therefore useful to know that 


7V = 


0 1 
1 0 


0 

0 


oV 

0 


0 


(39.10) 
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Plugging in our spinors and using this result gives us the result for the incoming 
interaction vertex that 


v{p')^u(p) = 2F(0,0,0,-1) | 



( ° ^ 


( 0 \ 

( ^ 0 \ 

0 

_ 9 rp 

i 

Vo ) 

0 

— — Ath 

i 


^ - 1 ) 


l 0 ) 


(39.11) 


So far, so good. Next we need to evaluate the outgoing part describing the muons. A 
useful insight here is that the quantity v(p')^y^u(p) can be thought of as a four-vector 
describing the spin and momenta of the incoming electron states. We will take the 
inner product with a similar vector describing the outgoing muon states. We notice 
that the muon vector u(k)^ u v(k') is almost the same four-vector as v(p')^^u(p) which 
we evaluated above for the incoming electrons. The difference is that it’s complex- 
conjugated and rotated by an angle 0 in the x-z plane. The complex conjugation is 
made easy with the identity 8 [u(p)^^u(k)]* = u(k)^y^u(p) and we certainly know how 
to rotate a four-vector. Conjugating and rotating the result in eqn 39.11 therefore 
gives us 



Fig. 39.4 Kinematics 
QED process. 


^See Exercise 39.3. 


for our first 


u(k)^^v(k') = [v(k , )'y f *u(k)]* (Complex conjugation) 

= [— 2E(0, cos 0, i, — sin 01* (Rotation about y by angle 0) 

= —2E(0, cos 0, —i, — sin 0). 

(39.12) 

Putting the amplitude together via the dot product v^ug^ v u^ v v and including the 
photon propagator yields up the invariant amplitude 


4e 2 E 2 

LM = —i —(1 + cos0). 

q 2 


(39.13) 


Noting that q 2 = (2 E) 2 , we obtain the simple result that —> Mr Ml) = 

—e 2 (l + cos0). 

This can be repeated for other combinations of chirality with the result that 

M ( e R e i -*• MrM^) = M ( e L e R Ml 4) = -e 2 (l +COS0), 

M(e^e£ -* MlMr) = ^( e L e R -> Mr Ml) = ~ cos&), (39.14) 

and all other combinations yield an amplitude of zero. 



Feynman diagrams 


Fig. 39.5 Gauge invariance, the Ward 
identity and conservation of electro¬ 
magnetic current are all intimately 
linked. 


39.3 Gauge invariance in QED 

Gauge invariance is central to our notion of QED. Global (7(1) symmetry 
of the Dirac Lagrangian guarantees the conservation of fermion number. 
Local (7(1) symmetry [guaranteed by the addition of the electromagnetic 
gauge field A^x)] allows us to identify the conserved fermion current 
with the conserved electromagnetic current and fixes the form of the 
QED interaction. Gauge invariance therefore leads to the conservation 
of electromagnetic current in QED. This is certainly central to the theory 
and must be maintained in our Feynman diagrams. Gauge invariance is 
indeed guaranteed in Feynman diagrams by the Ward identity. The 
symbiotic relationship between the concepts is shown in Fig. 39.5. The 
Ward identity comes in many forms 9 and the simplified one with which 
we’ll be concerned is shown diagrannnatically in Fig. 39.6. It says that 
if a sum of diagram parts contributing to an S'-matrix element may be 
written ■■■), where p labels the vertex to which a photon 

line is attached, and that the photon line carries momentum k and the 
external lines are on mass shell, then 



ip' 

T P 


S-matrix element 


0 


Fig. 39.6 The Ward identity. Dot the 
vector k^ with the contribution to the 
5-matrix shown and you get zero. 


John Ward (1924-2000). It has been 
said that his advances were used by oth¬ 
ers ‘often without knowing it, and gen¬ 
erally without quoting him’ (M. Dun- 
hill, The Merton Record , 1995). 

^ It is a special case of the Ward- 
Takahashi identities. See Peskin and 
Schroeder for its derivation. 


k fJ M ll (k,p 1 ,p2,.:) = 0. 


(39.15) 
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This effectively kills the term k^k^/k 2 in the numerator of the photon 
propagator, since it always combines with the sort of vertex shown 
in Fig. 39.6. 

We began our discussion of the photon propagator by arguing that 
the term fc M fc„/fc 2 disappears from the numerator of our propagator. We 
conclude here that its disappearance is caused by gauge invariance and 
is encoded in Feynman diagrams by the Ward identity. A diagrammatic 
example of the Ward identity is examined in the exercises and a justifi¬ 
cation of the physics may be found in the next example. 


Fig. 39.7 The half-dumbbell 
diagram. 



Fig. 39.8 Illustration of gauge in¬ 
variance in the diagrammatic language 
used in Exercise 39.4. 


Example 39.3 

To see how gauge invariance leads to the disappearance of this term we naively 
couple a general source J ! ‘ (not the conserved current of electromagnetism) to a 
massive vector field A fl . We will calculate the amplitude for starting with no vector 
mesons and ending with one, represented by the half-dumbbell diagram shown in 
Fig 39.7. We can write the amplitude for creating a massive vector meson with any 
polarization as 

(39.16) 

A 

where the sum is over the polarizations. Assuming a basis of linear polarizations (i.e. 
so = eA/x)? this leads to a probability 

P = \A\ 2 <xJ2^(k)J^(k)e Xtl (k)e Xu (k) 

A 

= (~9^ + fc M ^/m 2 )>(fc)J I ' t (fc), (39.17) 

where we’ve used the result for e \v(k)e\ u (k) = —P^ u discussed in Chapter 13. 
Next we ask what this tells us about the source emitting a photon, by attempting 
to take a limit m —> 0. We immediately see that the k^k u /m term blows up as 
m —> 0, which is the same problem we had with trying to apply the massive vector 
propagator to photons. 

However, this won’t cause a problem if k^J^{k) = 0, since this kills the trouble¬ 
some term. Notice that this is current conservation = 0 written in momentum 

space! We conclude that the disappearance of the troublesome k^k u /m 2 term is 
due to electromagnetic current conservation, which is itself attributable to gauge 
invariance. 


Looked at in another way, the disappearance of the k^k^/kr term allows 
us to add any multiple of this quantity back into the propagator. That 
is to say, vast helpings of a term that has no effect on the physics may 
be added for your convenience. A more general version of the photon 
propagator can therefore be given by 


Dq nif{k') 


—i (ff/xi/ + (1 ~ £,)k^k v /k 2 ) 
k 2 + ie 


(39.18) 


The simple choice £ = 1 is known as Feynman gauge, while £ = 0 is 
known as Landau gauge. As examined in the exercises, exactly this prop¬ 
agator follows from fixing the gauge in which we work in the Lagrangian 
itself. The freedom to choose £ is therefore another manifestation of 
gauge invariance. 
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We have written down the Feynman rules for QED and examined a 
first process. In the next chapter we go further and look at the main (and 
measurable) application of QED calculations: calculating the scattering 
amplitudes for some fundamental processes. 


Chapter summary 

• The photon propagator has been derived and every internal photon 
line contributes a factor — ig^ v /(k 2 + ie), though alternative choices 
can be made in different gauges. 

• We have presented the Feynman rules for QED. 


Exercises 


(39.1) We will use the gauge fixing technique to find the 
QED propagator. 

(a) Show that the equations of motion for the 
electromagnetic held A fl (x) written in momentum 
space are given by 

(fcV" - fc M fc") i M (fc) = 0. (39.19) 

(b) We will attempt to find the inverse of these 

equations of motion. Explain why this must be of 
the form = C{k)g v(T + H(k)k^k (7 . 

(c) Show that defining the inverse via 

M* 11 ' r = gf demands the impossible condi¬ 

tion 

-k 2 C{k)g£ + C(k)k IJ ’k a = gf- (39.20) 

(d) We may fix this by adding a term to the La- 
grangian that ensures we work in Lorentz gauge 

= 0. We therefore add a gauge fixing term 
and obtain a Lagrangian 

C = -l F ^F^-- (39.21) 

where £ is an arbitrary parameter. The point here 
is that the gauge fixing term will force the theory 
to be well behaved, but will contain the parameter 
£ which (we claim) will not enter into any measur¬ 
able quantity. Show that the equations of motion, 
written in momentum space, are now given by 

+ k^k 1 ' - kA v {k) = 0 . (39.22) 


(e) Finally, verify that the photon propagator, de¬ 
fined as i(M _1 ) m „, is given by 

tr m • /V-( 1-0*^ /onoo , 
Do^fik) = —i I - k 2 +ie j • ( 39 - 23 ) 

(39.2) Consider the reaction e + e~ —> p, + pT again. Most 
experiments are done by firing unpolarized beams of 
electrons and positrons at each other. Since muon 
detectors are usually not sensitive to muon polar¬ 
ization we actually need to throw away the spin in¬ 
formation. What we want is an average over initial 
electron spin states and a sum over the final muon 
spin states. 

(a) Explain why this is the right thing to do and 
show that this prescription leads to a probability 

^ ^ r ’ r ')\ 2 - (39-24) 

s s' r r' 

(b) Show that we obtain a scattering proba¬ 
bility for unpolarized electrons and muons of 

I Espins \ M \* = e4 ( X + cos2 9 )• 

We will revisit this next chapter. 

(39.3) Verify [u{p)^u{k)\* = m(A;) 7 m m(p). 

(39.4) We will examine how the term fc M fc„/fc 2 is killed 
in a simple situation. Consider the Feynman dia¬ 
gram shown in Fig. 39.8(a) which contributes, via 
the S-matrix to the second-order correction to the 
amplitude for fermion-fermion scattering. 
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(a) Using the Feynman rules write down the ampli¬ 
tude for the whole process. 

(b) Consider the bold part, shown in Fig. 39.8(b). 
This features a virtual electron propagator, with a 
photon line hitting it at the end labelled v. Show 
that the amplitude _4 par t for this part of the dia¬ 
gram is 

-dpart = u(v)(-\ei ] ) f —) He7 I/ ) 

/-iflV+ / \ /onoco 

H-J U(P) ’ ( S 9 - 25 ) 


where we’ve called the photon mass m 7 and the 
fermion mass m e . 

(c) Now just consider the dangerous part, which is 
proportional to l/m^ . Show that the guts of this 
boil down to something proportional to 

■Aguts = u(p')') v j v u (p). (39.26) 

f + ft - m e 

(d) Show that this may be further simplihed to 


A _ 1 u(p'h v k^u(p) 
^ suts “ k 2 T it 


(39.27) 


Hint: rewrite the denominator as ji = + ft — 

m e ) — (fl-me) and use Dirac’s equation in the form 
fu(p ) = m e u(p). 

(e) Now turn to the process shown in Fig. 39.8(c) 
which it is also necessary to consider when calcu¬ 
lating the second-order correction to the fermion- 
fermion scattering amplitude. Just consider the 
part shown in Fig 39.8(d) (and note the change in 
labelling). 

(f) Show that the dangerous guts of this part are 
given by 


„ /, v k^ku/m^ v 

S g uts = u(p )y ~' 


p' — }k — m, 
and that this may be reduced to 

1 u(p')k lt ‘y v u(p) 


7 v u{p), 


guts - 


k 2 + ie 


(39.28) 


(39.29) 


The point, then, is that we must consider both of 
these processes and the sum of the dangerous guts 
vdguts + B g uts = 0, which means that the dangerous 
k^kv/m^ term disappears. 

This argument is discussed in more depth in Zee, 
Chapter II. 7. 



QED scattering: three 
famous cross-sections 



In this chapter we examine three famous scattering examples in QED: (1) 
Rutherford scattering, which led to the discovery of the atom; (2) Mott 
scattering, which is the relativistic version of Rutherford scattering and 
(3) Compton scattering, which demonstrates the particle-like properties 
of light. These examples will introduce a number of useful tricks which 
are employed freely in more advanced applications. 


40.1 Example 1: Rutherford scattering 
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tering 355 
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tering 357 
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We’ll examine the scattering of an electron from a heavy, charged object 
such as a point-like nucleus. This scattering of distinguishable particles 
is described by a Feynman diagram shown in Fig. 40.1(a). The nucleus 
is very heavy compared to the electron, so we treat the the process as 
involving an electron passing through a classical potential The 

Feynman rule for the vertex describing the electron interacting with 
the potential is — iQ\e\jnA^(q) [Fig. 40.1(b)], that is, the electron (with 
Q = — 1) interacts via the Fourier transform of the potential [(x). 
This is easy to understand: the interaction between an electron and 
a potential is Hi = The ip brings an electron in and 

ip takes is out, so the interaction vertex is proportional to what’s left, 
which is For a static, positive electromagnetic potential we 

have A°j(r) = and A c i(r) = 0, so the potential we are after is 

the Fourier transform of the Coulomb interaction ^^(g) = The 

amplitude for the diagram in Fig. 40.1(a) for a (Q = —1) electron to be 
scattered by a nuclear potential is given by 



i|e|7M^cl (?) 


Fig. 40.1 (a) Scattering from a clas¬ 
sical potential, (b) The Feynman rule 
for a Q = — 1 electron to scatter from 
the potential. 



Example 40.1 


The kinematics of the scattering is shown in Fig. 40.2. Calculating the transferred 
momentum gives us 1 q 2 = — \q\ 2 = —4 p 2 sin 2 In this example we will work in the 
non-relativistic limit, for which we have that u(p')j°u(p) = 2 = 2m. Putting 
everything together we obtain an amplitude 


\M\ 2 


4 Z 2 e 4 m 2 
16p 4 sin 4 (6>/2)' 


(40.2) 


Fig. 40.2 Kinematics for Rutherford 
scattering. 

1 See Exercise 40.1. 


Recall that in the non-relativistic limit 


we have u(p) 


s 

£ 
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2 See, for example, Peskin and 

Schroeder, Chapters 4 and 5. 


Finally, we can relate our amplitude to something measurable, 
a potential, the differential cross-section 2 is given by 

da _ \M\ 2 

df2 (4-7 t) 2 ’ 


For scattering from 
(40.3) 


and so plugging in our expression for |A4| 2 , we obtain 

dcr Z 2 a 2 

df2 4m 2 v 4 sin 4 (0/2) ’ 


(40.4) 


An alleged scientific discovery has no which is Ernest Rutherford’s celebrated result. 

merit unless it can be explained to a | 

barmaid. 

Ernest Rutherford (1871-1937) 


40.2 Example 2: Spin sums and the Mott 
formula 


We collect here some useful tricks for 
evaluating expressions in QED. 

Spinor trick 1: 

[«(/)r«(j)]* = [«(i)7°r t 7°«(/)] 

Spinor trick 2: 

^ u s ( p)u s ( p ) = jf + m 

s 

Trace trick 0: 

Tr (/) = 4 
Trace trick 1: 

Tr(odd number of 7 matrices) = 0 

Trace trick 2: 

Tr(7'V) = 43^ 

Trace trick 3: 

Tr(7 M 7 l/ 7 p 7 CT ) 

= AisTsT - + 9^9 Up ) 

Trace trick 4: 

7 = - 2 / 

Photon trick: 

E e »(p) e t(p) ->■ - 9 ^ 

polarizations 

See Peskin and Schroeder, Chapter 5, 
for the origin of the photon trick. 

Q 

Manipulations used in eqn 40.7 are: 
Line 1: u s (p)pu s (p) v = ty+m\ f3r) . 

Line 2: Rearranging and using spinor 
trick 2 on primed variables. 

Line 3: Identifying a trace. 


If we work through the previous example again, but this time without 
invoking the non-relativistic approximation, we obtain the relativisti- 
cally correct version of Rutherford’s result, which is known as the Mott 
formula. The complication with working with relativistic particles will 
be the manipulation of spinors. 


Example 40.2 

The scattering amplitude is still given by iA4 = i^j§ -u s (p')j°u s (p), but here we’ve 
included the spin indices. We could work out the spinor part by brute force. How¬ 
ever, this isn’t usually necessary in real life. Experiments involve scattering initially 
unpolarized fermions and detecting the products of the scattering in detectors which 
aren’t sensitive to polarization. We therefore average over the initial spin state and 
sum over the final spin states. We therefore want | ^2 s r s |A4| 2 , whose spinor part 
is given by 

(p')7°w s (p )| 2 = [“ S (p')7°« s (p)] [« s (p')7°« s (p)] • (40.5) 

s' ,s s',s 

The calculations of these spin sums are quite straightforward, but do rely on the 
knowledge of a number of tricks (see sidenote). We will draw freely from them 
throughout this chapter. 

Using spinor trick 1, we have \u(f)ru(i)\ 2 = [u(f)y°u(i)] [«(i)7°«(/)] . Using 
this, and writing out indices, our spinor term becomes 

i y M s '(p , )a7° / 3« S (P)/3M S (P) I J7°A US, (P')A, (40.6) 

s,s' 

where the explicit inclusion of subscripts allows us to rearrange the elements behind 
the sum sign (and we assume a sum over repeated indices, as usual). Next we use 
spinor trick 2 which says [u s (p)u s (p)] a p = This is used to perform 

the sum over the spin polarizations s and s' as follows 3 

§E S , S ' \u s '{p')l°u s (p)\ 2 = 5E s 'W s, (p')a7° /3 [/+H/3 I7 7 °a u< ’ , (p')a 

= bli3ty+ m \i3r,l°\ty l + m ]\ a ( 40 - 7 ) 

= ^Tr [ 7 °(/ + m )7 0 (/ / + m )] • 

To recap, we’ve boiled down the sum over spinors into the trace of a number of 
matrices. 
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Next we note that the vector j/t = 7 and we use some fun properties of the 
traces of products of 7 matrices. Trace trick 1 says that traces of odd numbers of 7 s 
vanish which means we may write: 

~Tr [ 7 °^ + m) 7 °(/ + m)] = I {Tr [' ffy°f ] + m 2 Tr [( 7 0 ) 2 ] } . (40.8) 

It’s fairly easy to work out Tr( 7 °^ 7 0 ^') by plugging in explicit forms of the matrices, 
or alternatively we can use the trace trick 3, to obtain 4 (E p / E p +p' ■ p). Trace trick 0 
gives us 4ra 2 for the last term. Remembering that the scattering is elastic, we obtain 

1 ^2\u a ' (p'h°u 3 (p)\ 2 = 2 (£ 2 + p p' +m 2 ). (40.9) 

s,s' 


Referring back to the kinematics of the scattering, it’s straightforward to show that 
2(Ep+p 2 cos 0 + m 2 ) = 4E 2 (1 —/3sin 2 9/2), which combined with \q\ 2 = Ap 2 sin 2 0/2 
allows us to complete the problem. We finally obtain 

Ze^ 

. 2n2 ■ 4 „ - I3 2 sin 2 e/2). (40.10) 

Ap z fj z sin 0/2 




We can utilize our cross-section equation once more to obtain 


dcr 

dfl 


Za 2 

4p 2 /3 2 sin 4 6/2 


(1 — 0 2 sin 2 e/2), 


which is Mott’s formula. 


(40.11) 


40.3 Example 3: Compton scattering 

Compton scattering is the process e~ +7 —> e~ + 7 . The two lowest order 
diagrams (which are second order in the interaction term) are shown in 
Fig. 40.3. 


Example 40.3 


As practice in translating Feynman diagrams, we may translate the Compton dia¬ 
grams into amplitudes. The first diagram yields (upon dropping Qs again) 

^-channel = u(p')( [ \ e h 1 'Ky ( k ') . ,, w, * -—7* A (fc)(i| e^ufy), (40.12) 

\f + ft) ~ 171 + 16 

and the second diagram gives 

i-^u-c hann el = — -j-j -- e* v (fc')(i|e|7 M )“(p)- (40.13) 

(© — ft') — m + le 

These amplitudes may be evaluated to give the Klein-Nishina formula. However, the 
full derivation of the Klein-Nishina formula is rather lengthy and can be found in 
many books , 4 so we’ll evaluate the s-channel contribution from the first diagram for 
the special case of a highly relativistic particle. (Recall that here ‘relativistic’ means 
we can ignore the mass of the electron.) We therefore have that p 2 = 0 and (as 
usual) k 2 = 0. We’re interested in \A4\ 2 averaged over initial spin states and photon 
polarizations and summed over final ones (we pick up a factor 1/4 in doing this). 
We therefore want (employing spinor trick l 5 and dropping the photon polarization 
labels): 


I _ _ 

w v — 6 _ 

4 •“ •“ . (p + k) 2 

s,s' polarizations 


V s ' (p'h v e-t(k')(f + X)e M (fc)7' i « s (p) 


x w s (p)7 <T «a( fc )(/ + fa P (k , h p u s ' (p') • 


(40.14) 


I myself am neither an experimental¬ 
ist nor a real mathematician; my the¬ 
ory stops at the Schrodinger equation. 
What I’ve done in this subject is to look 
at the evidence, do calculations on the 
back of an envelope and say to the the¬ 
oretician ‘if you apply your techniques 
to this problem, this is how it will come 
out ’ and to the experimentalists just 
the same thing. 

Nevill Mott (1905-1996) 


Arthur Compton (1892-1962) 

With his wife and two sons, Dr. Comp¬ 
ton lives in Chicago in a big brick house 
filled with souvenirs of their world 
tour. He does not know the taste of 
hard liquor, almost never smokes, al¬ 
ways offers a cigaret to women visi¬ 
tors. He plays such a bang-up game 
of tennis that he sometimes has a hard 
time finding worthy opponents. Several 
times a month he puts in an evening 
of mandolin-playing with three friends. 
When his graduate students have fin¬ 
ished an examination, he likes to dine 
them and take them to the theatre. 
Time Magazine (1936). 



Fig. 40.3 Compton scattering, (a) 
The s-channel contribution, (b) The u- 
channel contribution. 

Yoshio Nishina (1890-1951). 

^Peskin and Schroeder give, as usual, 
an especially clear derivation. 

c 

We have the identity 

7° (7 l/ 7 A 7 M )^ 7° = 7 m 7 A 7 ^ 5 an d 

so 7 ° (7 I/ e;/M7' i ) t 7° = 
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Successive lines of eqn 40.15 use the fol¬ 
lowing ideas: 

Line 1: Photon trick. 

Line 2: Contracting indices and spinor 
trick 2 . 

Line 3: Using jf 2 = p 2 = 0 and trace 
trick 4. 

Line 4: Trace trick 3. 

Line 5: u = ( p' — k) 2 = —2 p' ■ k, 
s = (p + k) 2 = 2p • k. 


Freely employing our array of tricks we have 

„2 




s -channel 


4s 2 


(9^9vpU s ' {p') 7"(/ + X)W(p) 


x u s (p) 7 CT + fi)"/ p u 3 (p')) 


4s 2 


Tr 7 " 


rTr 


4 P 2 op 2 ?; 

= - r 2 (p.fc)(p'.fc) =-. 


(40.15) 



It turns out that the second diagram contributes — and there is 
no interference term between the two. The electron spin and photon 
polarization averaged, squared amplitude for Compton scattering is then 


Fig. 40.4 Crossing symmetry. A Feyn¬ 
man diagram with an incoming par¬ 
ticle may be manipulated to describe 
a new process involving an outgoing 
antiparticle with opposite charge and 
four- moment um. 


40.4 


£ w--** (;+=)■ 

s > s ' polarizations 

Crossing symmetry 


(40.16) 



Fig. 40.5 The s-channel process 
e — e + —> /.t - 


As a final illustration of the utility of Feynman diagrams in describing 
scattering, we briefly note another very useful feature. Feynman’s inter¬ 
pretation of antiparticles as particles travelling backward in time may 
be used to manipulate Feynman diagrams, allowing us access to sev¬ 
eral more physical amplitudes from the evaluation of a single Feynman 
amplitude. 

The principle is shown in Fig. 40.4, where we see that if we start with 
a diagram with an incoming particle <f> then we may flip the leg to create 
a valid process involving an outgoing antiparticle <f> with opposite charge 
and momentum. This process, known as a crossing, has the wonderful 
property that 

+ ...)=M ...+<f>(k)), p=-k. (40.17) 



Fig. 40.6 The ^-channel process 
e~fi~ —> e ~found by crossing the 
previous diagram. 


Example 40.4 

Consider the s-channel process e~ e - *" —> g,~ examined in Chapter 39 and shown 
in Fig. 40.5. If we ‘cross’ the diagram by reversing the momentum direction of the 
incoming e - *" and outgoing /x+ (changing the sign of their charge as we do so), then 
upon flipping the diagram over we obtain Fig. 40.6 describing the process e~ g,~ —► 
e~ . The amplitude for the resulting diagram is the same as the Ai we calculated 
in Chapter 39 if we make the replacements: pa —► kA , Pb —► — &B, PC &C an d 
PY) —> — k-£>- Note that we have turned an s-channel process into a ^-channel process 
with our crossing, reflected by the fact that instead of reversing individual momentum 
we may simply replace s —> t in the amplitude. 
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Crossing symmetries are clearly a very useful feature of the formalism 
because calculating \M\ 2 for one process immediately gives us access 
to |A4| 2 for all processes related by crossings. As we have only briefly 
touched on this topic here, we recommend you consult one of the stan¬ 
dard texts before deploying crossings in anger. Peskin and Scliroeder 
and Halzen and Martin both contain several examples. 


Chapter summary 

• This chapter has illustrated three simple scattering examples from 
QED: (1) Rutherford scattering; (2) the Mott formula; (3) Comp¬ 
ton scattering. 

• Crossing symmetry makes a Feynman amplitude for one process 
describe another process. 


Exercises 


(40.1) Verify that the transferred momentum in the Use a crossing to determine the analogous equation 

Rutherford calculation is given by q 2 = 4 p 2 sin 2 §. for the process e~ —> e~ fx~. 


(40.2) 

(40.3) 

(40.4) 


Verify the Trace Tricks used in this chapter. (40.6) 

(a) Calculate Tr(‘y°jfy 0 jf) using explicit forms of the 
matrices. 

(b) Repeat the calculation using Trace Trick 3. 

Consider the e“e + —> problem from the pre¬ 

vious chapter. Using the tricks considered in this 
chapter show that 

| E l^l 2 = ^4 Tr [(/- W-e)7 M (^+me)7 1 '] 
spins 

xTr [(X + m M )m M ) 7 „j (40.18) 

(b) Taking all particles to be highly relativistic, 
show that 


\ Y i^i 2 = [(p■ • fc/ ) + (p • k ')(p • fe )] • 

spins 

(40.19) 

(c) Finally, use the kinematics discussed in Chap¬ 
ter 39 to show | Jospins |A1| 2 = e 4 (l + cos2 $)• 

See Peskin and Schroeder, Chapter 5 for help. 


Mpller scattering is the process e~e~ —> e~e~. 

(a) Two diagrams contribute to the Feynman am¬ 
plitude for this process. Identify these and show 
that 

e 2 _ 

M = — — u(k)'y tl u(p)u(k')'y IJ u(p') 

p 2 

-|-u(fc , )7 M M(p)M(fc)7uw(p , )(40.21) 

u 

(b) Show further that, in the ultra-relativistic limit, 

\ Y \ M \ 2 = y {^ Tr Tr 

spins 

+ Tr ^ / 7 M ^7 I "j Tr \]l 7 M _/7„] 
“Tr 7^] 

- ^Tr ~lv\ | • 


(40.5) Show that eqn 40.19, which describes the process 
e“e + —> fi~ fi + , may be written in terms of Man¬ 
delstam variables as 


i £ i^i 2 

spins 



(40.20) 


(c) Use a crossing symmetry to turn the previous 
equation into one describing the process e“e + —> 
e _ e + , which is known as Bhabha scattering. 
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Fig. 41.1 Feynman diagrams for 
renormalized QED. 


His laboratory is his ball-point pen. 

Caption to a photograph of Julian Schwinger (1918-1994) 
(holding a pen) 
a 
2tt 

Engraved on Julian Schwinger’s tombstone 

Some really stunning consequences of QED are seen when we examine 
processes that contain electron-positron loops. The integrals we en¬ 
counter when considering these processes contain divergences that we 
must tame with renormalization. In this chapter we will demonstrate 
two of the great results that follow from renormalizing QED. These are 
(1) the fact that the electronic charge is not a constant and (2) the fact 
that the g -factor of the electron is not quite g = 2. 

To renormalize QED we calculate three Green’s functions: 

(a) Electron 1PI self-energy —iE(^), 

(b) Photon 1PI self-energy ifl'-'Qcj), (41.1) 

(c) Interaction vertex function — i(3|e|r M (p,2/). 

These functions, each made of an infinite sum of Feynman diagrams, 
are themselves shown diagrammatically in Fig. 41.1(a-c). In calcu¬ 
lating each of these Green’s functions we will encounter contributions 
which lead to divergences. These divergences are fixed through the in¬ 
troduction of counterterms. QED is a renormalizable theory and only 
three counterterm diagrams, each corresponding to one of the Green’s 
functions above, are required. The QED counterterms are shown in 
Fig. 41.1(d f). They have Feynman rules: 


(d) Electron self-energy (Bird on a wire) i (flB + A), 

(e) Photon self-energy (Bird on a wave) \{g^ v q 2 — q fJ ’q v )C, 

(f) Interaction vertex (Electrocuted bird) iQ^^D. 

(4T2) 

In order to renormalize the theory we impose conditions on each of 
the Green’s functions which ensure that we are expanding in the correct 
masses and coupling constants. For QED the renormalization conditions 


—iE (p = in) = 0 fixes electron mass to to, 

if l^(q = 0) = 0 fixes photon mass to 0, (41-3) 

—iQlelf^p' — p = 0) = iQlejy^ 1 fixes charge to Q\e\. 


are: 
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We will begin our tour of the consequences of renormalizing QED by ex¬ 
amining the case of the photon propagator and show how renormalizing 
it explains the dielectric properties of the vacuum. 


41.1 Renormalizing the photon 

propagator: dielectric vacuum 


A dielectric is an insulator which may be polarized with an applied 
electric field. This is encoded in the dielectric constant of a material e, 
which tells us how much the dielectric alters the electrostatic potential 
of a charge. Renormalized QED predicts that the interacting vacuum 
is a dielectric! The dielectric properties of the vacuum arise because 
interactions dress the bare photon in a cloud of electron-positron pairs. 
To see how this works we will examine the photon’s self-energy. As in 
Chapter 33 we define the 1PI photon self-energy which is given by 


in^(9) = E( 


All 1PI diagrams that can be 
inserted between photon lines 


(41.4) 


(This differs in sign from the 1PI self-energy Green’s functions encoun¬ 
tered previously.) Some examples of contributions to iII Ally (g) are shown 
in Fig. 41.2. Notice that these involve the photon removing electron- 
positron pairs from the vacuum in various ways and then returning them. 

Looked at in terms of pictures, it’s clear how the photon self-energy 
will renormalize the photon propagator. As shown in Fig. 41.3, we sim¬ 
ply sum the contribution of the self-energy to infinity. The version in 
equations is made a little trickier by the presence of the indices, which 
reflect the vector-like nature of the photon field. 1 


Example 41.1 

In equations the sum is 

£>nv(q) = Do»v(q) + D Qtl \(q)[\fl Xr] (q)\D 0r] „{q) + ... 



(41.5) 

Summing the series we obtain 

D^q) =-^-+ ^n m {q)DUq), (41-6) 

q J <r 

or 

(q 2 9m — n^77 (q^j £^u(q) = —i(4i-7) 
Things are simplified rather by the use of the Ward identity, which guarantees that 
q u I\.^ u {q) = 0. As a result, we can write the photon self-energy as 

^ u (q) = (q 2 9^-q»q iy Mq)- (41-8) 

Substitution of this new form yields 



Fig. 41.2 Some contributions to the 
Green’s function iU^ u (q). 


1 Remember that the massless nature of 
the photon field means that photons are 
not vector particles. 


(a) 



Fig. 41.3 Renormalization of the pho¬ 
ton propagator. 


q 2 [ 1 - n (q)]D^(q) + fl(q)q l _ l q v D^(q) = -i 


(41.9) 
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whose solution is 


D»v{q) = 


-i 9r» 




(41.10) 


q 2 [l — 11(g)] g 4 [l-ri(g)] 

In practical calculations the q^q^ part never contributes to any observable amplitude. 
This is because D flu (q) is always coupled to conserved currents and we know that 
qiiJem = 0 . 


The resulting renormalized photon propagator is given by 


D^{q) 


q 2 [ i - n(g)] 


(41.11) 


Note that we would expect that 11(g), which contains loops, will diverge 
unless we use counterterms in our calculation of II (g 2 ). 



Fig. 41.4 The lowest order correction 
to the photon propagator and the coun¬ 
terterm that prevents its divergence. 


Example 41.2 


Let’s see how to use the counterterm. Instead of summing over all 1PI inser¬ 
tions, we’ll limit our calculation at a single electron-positron loop, which we’ll call 
if f^ u (q) = i(g fJ ' 1 'q 2 — q^q u )^(q)- To prevent the loop causing any trouble we include 
a second-order counterterm which makes a contribution i(g^ u q 2 — q (J, q L ')C ^. The 
total insertion in the photon line is shown in Fig. 41.4. Summing this insertion to all 
orders, we obtain 


A«>-(«) = 


- 1 9 a-' 


(41.12) 


g 2 { l- [jr(«) + CW] }’ 

The renormalization condition is that the photon self-energy 11(g) vanishes at q 2 = 0, 
implying that tr(q = 0) = —C 42 - 1 and the propagator becomes 


A w(q) 


__ 

q 2 {1 - [*(<?) - 7f(0)]}' 


(41.13) 


This will be enough to remove the divergences from the calculation of D /JiU (q) to 
second order. 


o 

Using the shorthand that eg is the 
unrenormalized charge and assuming 
that the counterterm contribution is ac¬ 
counted for by 11(0). 


The bare photon propagates while tearing electron-positron pairs from 
the vacuum. These processes have the effect of modifying the photon’s 
amplitude for propagation between two points. More than that, we 
know that virtual photons (and remember that all photons are, to some 
extent, virtual photons) couple to a fermion line at both ends of their 
trajectory. If we write the vertex Feynman rule as — iQ|eo|7 M , we can 
bundle up the factors of |eo| with the photon propagator, so that we 
have 2 (considering Q = 1 for simplicity): 


Du V (q) = 


ie 0 g^ 


g 2 {l- [n(g)-n(0)]} 


-i 9m 

r,2 


1 - 


U(q) - f[(0) 


(41.14) 

Written the second way, the term in the brackets may be considered the 
momentum-dependent electric charge e(q). We could say that the real- 
life electric charge, renormalized by the interactions with the vacuum, is 
given by |e| = |eo|/ jl — n(g) — f[(0) j. The electric charge therefore 



41.1 Renormalizing the photon propagator: dielectric vacuum 363 


depends on the momentum transferred by the virtual photon that me¬ 
diates the electromagnetic force! A physical explanation for this feature 
of the dielectric properties of the vacuum is provided by the concept of 
screening, as we discuss below. 


Example 41.3 

In what follows we will require an expression for the one-loop amplitude in Fig. 41.5, 
which may be translated from diagram to equation, giving 

= (—1)(—ie 0 ) 2 [ 7X-wj Tr (''t' 1 -,— —7" 7-) ■ (41.15) 

J (27r) 4 V f - m f + fl—rn) 

This integral can be done. With the inclusion of the counterterm it is found that 
7r (q) — 7r(0), the one-loop contribution to n(g), is given by 

H q ) - m = -^£ dx{x - x 2 )m ( m2 _™~_ x2)q2 ) ■ (41.16) 


We’ve seen that the renormalized propagator is given, to second order 
in the interaction, by 

D (a) = ~ 19 ^ _ — _ 

MQ) Q 2 {1 -[*(«)-*(())]} 

« =%^[l + 7r(g)-7r(0)]. (41.17) 

q 2 

To see the consequence of this on electrostatics, we need the static ver¬ 
sion of the propagator, obtained by taking q 2 = — q 2 thus: 



Fig. 41.5 The photon propagator with 
a single electron-hole loop insertion. 


n(-q) - 7r(0) 


2-7T 2 


J da; (x — x 2 ) In 


m 


+ (x — x 2 )q- 


(41.18) 


Expanding this in the limit q 2 <C m gives us 


D^(q) 


' 9^4 / \ 

q 2 \ 15tt m 2 7 ’ 


(41.19) 


where a = eQ/47r. 

Recall from Chapter 20 our (Born approximation) interpretation of 
the momentum space propagator as proportional to a matrix element 
of a potential V(r) via Doo(q) oc — f cl 3 r e lg r V(r). Doing the inverse 
Fourier transform of D 00 (q) yields a potential due to the electron of 

= - {yi + ^' <3,(r) } ■ (4L20) 

The first term is Coulomb’s potential for a point charge. The second 
term is the correction due to the screening of the electron charge that 
is provided by the virtual electron hole pairs. The screening effect is 
visualized in Fig. 41.6, where we represent the pairs as effective dipoles. 
The screening is known as vacuum polarization. 



Fig. 41.6 Screening in QED. 
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3 Willis E. Lamb, Jr (1913-2008) mea¬ 
sured the energy level shift in hydro¬ 
gen in 1947. [See W. E. Lamb and R. 
C. Retherford, Phys. Rev. 72, 241 
(1947)]. Historically, this measurement 
was of immense importance in convinc¬ 
ing physicists to take corrections due to 
virtual photons seriously. Shortly after 
the announcement of the experimental 
result, Hans Bethe (1906-2005) made a 
non-relativistic calculation of the shift 
caused by the self-energy of an elec¬ 
tron in the electromagnetic field of a 
nucleus using an early form of renor¬ 
malization. The agreement with ex¬ 
periment was good and the fully rela¬ 
tivistic explanation soon followed from 
a number of physicists. See Schweber 
for the history and Weinberg, Chapter 
14, for the full calculation of the Lamb 
shift. 


Importantly, this effect can be measured! It causes a shift in the hy¬ 
drogen energy levels of A E = — x i° )2 |t/’(0)| 2 , where ip(x) is the hydrogen 
wave function (which is only nonzero at the origin for l = 0 levels). This 
effect causes a shift in the 2 Si to 2Pi transition in hydrogen of —27 MHz 
which, despite making up only a small part of the famous Lamb shift 
(of +1057 MHz), has indeed been experimentally verified. 3 The effect of 
vacuum polarization is also very important in the quantum field theory 
of metals and will be discussed further in Chapter 43. 


41.2 The renormalization group and the 
electric charge 

Using what we know about vacuum polarization along with the renor¬ 
malization group (of Chapter 34) we can work out how the electric charge 
changes, depending on the size of the momentum that the photon is car¬ 
rying. That is, how the QED coupling depends on the energy scale. To 
do this we need to calculate the /3-function, given here by /3 = /i^-, 
which tells us how the renormalized charge e depends on the choice of 
energy scale /i. 


Example 41.4 

Using our equation for the renormalized electric charge we can expand for small n(q) 
to obtain 


e 2 hd = — 


(+0) - tt(0)) 


|e| ^ |e 0 ) 

The /3-function is then given by 


: e o[! + (*(**) - 7t(0))] + 0(e 4 ), 


1 + 2 ^(^) ~ ’HQ) 


d|e| 1 .difO) 

/? = Al +7 = 2 M|eo| ^T- 


Now we use eqn 41.16 which states that 


*(m) - tt(0) = - 


+L f 1 

27T 2 Jq 


dx (x — x 2 ) In 


m 


\m 2 — (x — x 2 )/j, 2 J 


In the large energy scale limit (g m) differentiation gets us 


<MaQ 

dfi 


f 

Jo 


dx (x — x 2 ) = 




(41.21) 

(41.22) 

(41.23) 

(41.24) 

(41.25) 


Note that, to the order to which we’re working, it’s permissible to swap eo for e on 
the right-hand side in what follows. 


We obtain a /3-function 


d/i 12-7T 2 ’ 


(41.26) 


where we accentuate the + sign, since that’s the most important part. 
This shows that the electromagnetic coupling increases with increasing 
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energy scale. Another way of saying this is that it increases with de¬ 
creasing length scale. The electron appears to be more strongly charged 
as you get closer to it. This makes sense in the context of the last sec¬ 
tion, where we interpreted vacuum polarization as a screening process. 
As you get closer to the electron you penetrate the cloud of screening 
pairs and the charge seems to increase. 


41.3 Vertex corrections and the electron 
g-factor 


Other people publish to show you how to do it, but Julian 
Schwinger publishes to show you that only he can do it. 
Anon, quoted from S. Schweber, QED and, the Men who 
Made It 


The magnetic moment operator of the electron may be written 


A = 9 



S, 


(41.27) 


where Q = — 1 for the electron. The Dirac equation predicts 4 that the 
electron g-factor of the electron is exactly 2. In this section we will make 
the link between renormalized QED and the g-factor. This will lead to 
Julian Schwinger’s famous prediction that the first-order correction to 
the QED interaction vertex causes a shift from g = 2tog = 2 + a/7r. 
That is to say that the fact that electrons can emit virtual photons 
changes the form of the electron-photon interaction in a measurable 
manner. 

The Green’s function for the electron-photon vertex can be written 
—iQ|e|r A ‘(p,p / ) and its diagram is shown in Fig. 41.7. It is defined as 5 


( All amputated insertions with one \ 
incoming fermion line, one outgoing 
fermion line and one photon line. J 

(41.28) 

Some examples are shown in Fig. 41.8. The vertex function can be 
read as describing an off mass-shell photon decaying into an electron- 
positron pair. Since the photon lias 6 J p = 1 ~, there are two possible 
configurations of the pair. It can have L = 0 and S = 1, or L = 2 and 
S = 1 . This fact is reflected in the form in which we write the vertex 
function as a sum of two terms. 

F'W) = 7^1 (<Z) + ~^F 2 (q), (41.29) 

2 m 

where q^ and where = 4 [ 7 ^, 7 "]. The function Fi(q) is 

known as the Dirac form factor 7 and F 2 (q) is known as the Pauli form 
factor. For vanishing q we need only consider the first-order contribution 
to T^ = 7 m , which is the first diagram in Fig. 41.8. We read off that for 
q —> 0 we have iq (0) = 1 and F 2 { 0) = 0. 


4 See Example 36.9. 



Fig. 41.7 The vertex Green’s function 
-iQ\e\re(p, p '). 


c 

The overal sign is chosen because 
the first-order interaction is given by 
— iQ|e| 7 M so to first-order = 7 ^. 



Fig. 41.8 Contributions to the Green’s 
function —iQ|e|r^. The contribution to 
first order is simply given by the inter¬ 
action vertex —i|e| 7 M . 

^In this notation we list the total angu¬ 
lar momentum J and the intrinsic par¬ 
ity P of the photon. 

^In fact, F\ (q) may be thought of as 
a Fourier transform of the charge dis¬ 
tribution and has the property that 
Fi( 0 ) = 1 at all orders of perturbation 
theory. 
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Fig. 41.9 The vertex function coupled 
to a classical source. 


We will now sketch out the determination of the g-factor of the elec¬ 
tron in a manner intended ‘to show you how to do it’. Since the calcula¬ 
tion is quite lengthy and involved, we will break it down into a number 
of steps. 

Step I: Consider the diagram in Fig. 41.9 showing the interaction 
vertex coupled to a classical source A'Ax), giving a Feynman rule con¬ 
tribution of — iQ\e\I' IJ, A ( jf(q), where q M is the momentum of the photon. 
Coupling this to the spinor current of the incoming and outgoing elec¬ 
tron, we have an amplitude 

\M = -iQ\e\u(p')t^(p,p')u(p)A^(q ), (41.30) 

where q = p' — p and an energy momentum conserving delta function 
has been suppressed. Inserting eqn 41.29 for F M yields 

\M =-iQ\e\u{p') 7 ^Fi(g)+ ^ F 2 {q) u(p)A c J(q). (41.31) 

Step II: The next step makes the u'j^u in the first term much more 
complicated and uses an identity, known as the Gordon decomposition 
(proved in Exercise 36.7), given by 

i V /- 4 -L \rr^ Ll ' a 

u(p')^u(p) = u{p') 7 —-b ——^ u(p). (41.32) 

Zm Zm 

The reason for this step is that is eliminates pushing the spin- 
independent part into the first term and the spin-dependent part into 
the second. Substituting this into our Feynman amplitude yields 


i M 


—iQ\e\u{p') ^ u{p)A^{q)F 1 {q) 

+ [u{p')a^q v u{p)] A*(q) {F^q) + F 2 (q)} . (41.33) 


We now have an expression for the amplitude which is separated into 
the spin-independent first term and a spin-dependent second term. The 
information on the g-factor is to be found in the second term. 


Step III: Now we take the non-relativistic limit. This involves writing 



We also need to use the explicit forms for the 


components of cr M " 



and er y 


s ijk a k 0 \ 

0 e ijk a k ) ' 


(41.34) 


Using these we find that u'a 0l u = 0 and, more importantly, that 
u'a ij u = 2 m [£ n a k (\ £ ijk ■ 

Putting this all together we have, for the spin-dependent part of the 
amplitude of our diagram in the limit q —> 0, that 


Spin-dependent 

amplitude 


( 2m)|M {F,(0) + F 2 ( 0)} 

(2m)® A] B k (q) {1 + F 2 { 0)} , (41.35) 
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where we’ve assumed A cX (x) = ( 0 ,A cl (a;)) and used the fact that, in 
momentum space, the components of the magnetic field are given by 
\£ l j k q' A d Hq) = B k {q). We conclude that the spin-dependent part of 
the amplitude is proportional to Q|e|{l + F 2 (0)}(S) ■ B , where the ex¬ 
pectation value of the spin operator is ( S ) = ^cr^/ 2 . 

We now have an amplitude, so we may extract a scattering potential 
using the Born approximation. 

iA4 = — i(f\V(x)\i). (41.36) 


Since we are still using the relativistic normalization, we need to correct 
by dividing through by twice the energy of the electron that’s scattered 
(= 2m). We obtain a real-space potential 


V(x) = -^M{1 + F 2 (0)}(S) ■ B{x). (41.37) 

m 

Comparing this to the expected expression for the potential energy of a 
magnetic moment fi in a magnetic field, namely 


V(x) = -(A) • B{x) 



B(x), 


(41.38) 


we may read off that g = 2 [1 + _F 2 (0)]. 

So g is not exactly 2! There’s a correction determined by F^O). The 
correction can be calculated by evaluating the contributions to r M . 

Step IV: Let’s look at the contributions to — i<Q|e|T M , order by order. 
To first order we have the bare vertex whose contribution (for Q = — 1) 
is i|e| 7 M , so the first-order contribution to T M is 7 ^ implying .Fi(O) = 1 
and F 2 (0) = 0 as we said earlier. There are no second-order diagrams, 
but to third order the only diagram that contributes to F 2 (q) is that 
shown in Fig. 41.10. 


Example 41.5 

From the labelled diagram, shown in Fig. 41.10, we can write down the amplitude 


for the vertex as 


pM 

diagram 


7 


d 4 fc 


~ l 9vp 


-u(p')(i|e|7 !/ )- 


(2vr) 4 (p — k) 2 + ie ^ " 1 11 V'- m + i e 7 + 

(41.39) 

This is divergent, so we must also include a third-order counterterm. With the 
inclusion of the counterterm in Fig. 41.1(e) and two pages or so of Dirac algebra 8 
one can derive a result for the Pauli form factor which is 1*2(0) = e 2 /Sn 2 . 


-(i|e| 7 P )u(p). 



Fig. 41.10 The third-order diagram 
which contributes to 1 * 2 ( 0 ). 


^ See Peskin and Schroeder, which is 
particularly informative on this sub¬ 
ject. 


The result: We evaluate g = 2[1 + ^(0)] with F 2 (0) = and find 
the famous result for the shift in the g-factor: 

9 = 2 i 1 + £ + • • ■) • (4L40) 

This was also Julian Schwinger’s result and one for which he justifiably 
gained a spontaneous round of applause when he revealed, in a talk at 


Recall that in the units used here, the 
fine structure constant is given by a = 
e 2 /47r. Because a = 1/137.036... we 
have g = 2.0023... In SI units, a = 
e 2 / A'Keohc. 
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See Peskin and Schroeder, page 196, 
for a discussion of the remarkable agree¬ 
ment of QED with modern experimen¬ 
tal tests. 


the New York APS meeting in 1948, that his theory agreed precisely with 
the crucial experiments. Of course, higher order corrections to this and 
other QED processes may be evaluated and have led to QED becoming 
the most stringently tested theory in physics. 9 


Chapter summary 

• Renormalizing the photon propagator shows that the electric 
charge is given by |e| = |eo|/[l — 11(g)]. The electric charge is 
screened by virtual electron-positron pairs (known as vacuum po¬ 
larization) and this gives rise to the Lamb shift. 

• The Dirac theory predicts g = 2, but including the electron-photon 
vertex —iQ\e\Y(p,p') (for an off mass-shell photon decaying into an 

electron positron pair) yields g = 2 (l + ^ H-) which agrees 

extremely well with experiment. 


Exercises 

(41.1) Verify 

D^(q)= + iq ^ U S Q) , (41.41) 

q 2 [l — 11(g)] q*[l-H{q)]' 

is the solution to eqn 41.9. 

(41.2) (a) Verify 

^(^^(l + j^A). (41.42) 

(b) Take the inverse Fourier transform and show 
you obtain the form in the text. 

(41.3) The electron 1PI self-energy is defined as 


(41.4) The Schwinger model deals with quantum electro¬ 
dynamics in (2+l)-dimensional spacetime and pre¬ 
dicts a one-loop vacuum polarization 

• 2 

~iii/ii/(p) = ~P 9pv)- (41.44) 

7 TfT 

Show that the photon takes on a mass as a result 
and determine its size. 

(41.5) Verify eqn 41.25. 

(41.6) (a) Show that 


—i a‘ 0 

0 icr* 


-* = £ 


All 1PI insertions with incoming 
and outgoing electron lines 

(41.43) 

Draw contributions to — iE in QED up to fourth 
order in the interaction. 


and 


ijk k r. 

„ _ I s cr U 

a ' - c ijk k 


0 e 3 cr 

(b) Show that ua t3 u = cr k £,e X3k . 


(41.45) 


(41.46) 




Part X 


Some applications from the 
world of condensed matter 


Quantum field theory finds many applications in the field of condensed 
matter physics which is concerned with the properties of many-particle 
systems whose fundamental excitations can be regarded as particles. We 
have already seen that lattice vibrations can be treated as particles called 
phonons. In the case of metallic systems which contain large numbers 
of electrons, the interactions between electrons can be very important 
in determining the properties. 

• Our first condensed matter physics example is the superfluid, and 
in Chapter 42 we show how an approximation due to Bogoliubov 
allows weakly interacting Bose systems to be treated. The quasi¬ 
particles are called bogolons and the ground state of the system 
can be treated as a coherent state. The Noether current is shown 
to arise from a gradient in the phase of this coherent state. 

• Interactions between electrons in a metal are treated in Chapter 43 
and we introduce the Cooper, Hartree and Fock terms in the per¬ 
turbation expansion for the ground state energy shift. We explain 
how to model the excitations and study them using propagators 
and we introduce the random phase approximation. 

• The BCS theory of superconductivity is presented in Chapter 44 
and we show how to write down the BCS coherent state, derive 
the quasiparticles and explore the effect of broken symmetry. This 
allows us to explore the Higgs mechanism in a superconductor. 

• The fractional quantum Hall effect is introduced in Chapter 45 and 
we explain how quasiparticles carrying fractional charge emerge in 
this theory. 

A note on notation: In this part we conform to the conventions of the 
condensed matter literature by denoting the number of particles by N 
and define the number density of particles n = N/V, where V is the 
volume. For consistency we write number operators and occupation 
numbers for the state with momentum p as N p and N p respectively. 
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There is no doubt a special place in hell being reserved for 
me at this very moment for this mean trick, for the task is 
impossible. 

Robert Laughlin (1950- ), on asking his students to deduce 
superfluidity from first principles 

A gas of weakly interacting, non-relativistic bosons is a problem where 
the methods of quantum field theory allow us to understand the ground 
state, the excitations and the breaking of symmetry. Not only is this 
an illustration of many of the methods we’ve discussed; it also applies 
to one of the most fascinating phenomena in condensed matter physics: 
superfluidity. A superfluid is a state of matter where momentum flows 
without dissipation. When 4 He is cooled below 2.17 K it becomes a 
superfluid and may flow through capillaries without resistance. It also 
exhibits the fountain effect whereby a beaker of superfluid spontaneously 
empties itself. 

The dispersion of superfluid helium has been measured with inelastic 
neutron scattering and is shown in Fig. 42.1. At low momentum the 
dispersion is linear, at large momentum the energy goes as p 2 /2m as 
expected for free particles. In the middle is a large minimum. We will 
discover that our gas of weakly interacting bosons captures the low and 
high momentum behaviour of the superfluid dispersion. We will also 
find that the form of the field that we obtain on spontaneous symmetry 
breaking necessarily gives us a superflow of momentum. 
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Fig. 42.1 The measured dispersion 
of superfluid helium. Data taken from 
A.B.D. Woods and R.A. Cowley, Can. 
J. Phys. 49, 177 (1971). 


42.1 Bogoliubov’s hunting license 


The problem revolves around the Hamiltonian for non-relativistic Bose 
particles interacting with a momentum-independent potential g. The 
Hamiltonian describing this state of affairs is 


H = — d 3 xVft(x)-V4>(x) 

2 TO J 

+ | f d 3 xd 3 yft{x)ft(y)4>(y)4>(x)8 ( ' 3 \x - y) 


~ E 2m h P &p + 2V^ p ~^ k+q&kilp ' (42-1) 

p kpq 

where the second form follows from putting the system in a box and 
inserting the mode expansion <t>{x) = '/2 p d P e lpx . As usual, we call 




42.1 Bogoliubov’s hunting license 371 


the interacting ground state of this Hamiltonian fl). This is intended 
to describe a very large number of interacting particles. For a macro¬ 
scopic sample of matter in a laboratory this number might be around 
10 23 . Unfortunately, the interaction part of the Hamiltonian, given by 
Hi = ^ 'ZZkpq ®p-qflfc +q afca p , is too complicated to solve, so we need 
an approximation. Bogoliubov came up with one by thinking about the 
low-energy behaviour of the system. 1 Bogoliubov’s assumption (based 
upon Bose-Einstein condensation) is that, if the system has very little 
energy, the number of particles N 0 in the p = 0 ground state of the 
Hamiltonian will be macroscopically large. The consequence of this is 
that instead of the exact relation a p |H) = VNq\N 0 — 1) (where |AT 0 — 1) 
is a state achieved by removing one zero momentum particle from the 
ground state), we can take a p=0 |H) \/No|H). 

Bogoliubov used this as an approximation 2 to allow the replacement 
of operators a p= o and ctp =0 with the number v^o- He then broke down 
the sum over momentum states ^2 kpq Op-gdfc-fqOfcdp by considering, in 
turn, the terms in the sum in which particular momentum subscripts are 
zero, and replacing that operator with the number \/N 0 . Since terms 
with odd numbers of operators will always give zero expectation value, 
we need only consider the cases of two or four of the indices being zero. 
The method then relies on sending the momentum index of pairs, or all 
four, of the operators to zero. Then we replace the operators ao and Sq 
with y/N q. 


Nikolay Bogoliubov (1909-1982) 

1 For ‘low-energy’, you might read ‘low- 
temperature’ if you’re imagining an ex¬ 
periment in a laboratory. 


2 

In his words, this approximation pro¬ 
vides a ‘hunting license’ to search for 
new phenomena. 



Example 42.1 




We take turns setting pairs of momentum subscripts 

to zero. Replacing pairs of 

subscripts yields six terms: 




p = 0, fc = 0, 

d_ q a}jdodo - 

-> N 0 a ] _ q a\, 


p = 0, k + q = 0, 

d_qd^d ].gdo 

-*■ N 0 a' k a k , 


p = 0, p — q = 0, 

d 0 d k dfc do 

► N 0 a' k a k , 


k = 0, fc + q = 0, 

CL^jCLqCLqCLp 

> Noa^ap, 


fc = 0, p — q = 0, 

d()dq do dp i 

- N 0 a^a p , 


fc + q = 0, p — q = 0, 

d^d^d—qdq 

> Nq d — qdq. 

(42.2) 

Lastly, we replaces all four of the subscripts to get 



ajajaodo = Nq. 


(42.3) 

1 


With this approximation we obtain an approximate interaction Hamil¬ 
tonian 


Hi 


9 

2V 


Nq + 4 N 0 ^ a p n p + -No + a p a_ p ) 

p#o p^O 


(42.4) 


Now we must prepare ourselves for a shock. This seemingly quite reason¬ 
able approximation of saying that there are a large number of particles 
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3 

Note that this global transformation 
doesn’t depend on the momentum. 

' l liis result is perhaps unsurprising in 
that our ‘hunting license’ approxima¬ 
tion is to equate \N — 1) with 11). which 
shows a slightly cavalier attitude to par¬ 
ticle number. 


in the lowest momentum state has a dramatic consequence: it breaks a 
symmetry. Our original Hamiltonian is invariant under the global 17(1) 
transformation 4>{x) —> cf>(x)e ' a , or equivalently a p —> a p e la . Looking 
at Hi, we see that the terms aj,a! p and a p a_ p both change if we make 
the latter transformation. 3 We have therefore lost the 17(1) symmetry 
and its conserved quantity, which (remembering back to our discussion 
of global 17(1) symmetry) was particle number. 4 

Since we’ve now lost the conservation of particle number, we don’t 
know what value to assign to Nq. To remedy this we set the total 
number of particles N equal to the number in the p = 0 ground state, 
plus all the rest, that is: 


n = n 0 + J 2 (42.5) 

p 7 o 

The result of these manipulations is to yield an effective Hamiltonian 

H = ^(^ rn +ng]ala p + ^^2ng(ala , _ p + a p a_ p ),{4:2.6) 
p/o ' ' p7o 

where n = N/V and we’ve dropped the constant \gn 2 term. 

The next problem is that the potential term in the Hamiltonian isn’t 
diagonal. That is, it isn’t expressed in terms of number operators of the 
form b' q b q . This may spell trouble unless there’s a way to turn these non¬ 
diagonal objects into diagonal ones. That is, we want to turn objects 
like a) p a*_ p and a p d- p into number operators. 

42.2 Bogoliubov’s transformation 

The way to turn these non-diagonal bilinears into number operators is 
to transform to a new set of operators. The procedure that does this 
is a Bogoliubov transformation which defines a new set of operators 
a p and d' p via 


f b p \ / u p — v p \ f d p 

V ®-p / V ~ v p u p ) V ^-p 

and we will use these operators to diagonalize the 
quantities u p and v p obey the following rules: 

u p ~ v p = 1 > u p = u p ’ v p = v p ’ 


Hamiltonian. 


(42.7) 

The 


(42.8) 


which are cunningly designed so that the new operators d p and dj, obey 
the same commutation relations as d p and a|,: 


Op, Oq bpq, [Op, Cfq] — Up, Oq 0. (42.9) 


The physical significance of the new operators is that they describe a new 
sort of excitation: a new bosonic quasiparticle we will call a bogolon. 
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Example 42.2 

We can show how the bogolon operator diagonalizes the Hamiltonian. The Hamilto¬ 
nian can be written in matrix form as 


h = E ( 4 

p 


a—p 



(42.10) 


where e p = E— + ng. We’re going to make the transformation given in eqn 42.7 
and fix the constants u p and v p to make the resulting Hamiltonian matrix diagonal, 
which is to say 


& = ( “p & -p 

p 


D n 0 

0 D22 



(42.11) 


Assuming that we’re successful in diagonalizing, how do we get the excitation energy, 
that is the constant in front of a p a. p l The trick is to look at the diagonalized form 
which is 

H = Y, + -P22d- p o;i p j 

p 

= Y, [-^n^p^P + -^22(1 + dl p d_ p )J . (42.12) 

p 

The sum over —p’s can be re-indexed to be a sum over positive p’s and we see that 
the constant in front of dpd p is Du + D 22 , otherwise known as the trace of the 
diagonal matrix. The trace is given by 

E p = c p (up +Vp) - 2ngu p v p . (42.13) 


The condition for the off-diagonal elements to be zero is that 

2 u p v p ng 

+v% e p 

Substituting the condition allows us to eliminate u p and v p 
answer given in eqn 42.15. 


(42.14) 

and we obtain 5 the 


5 

You are invited to verify this in Exer¬ 
cise 42.1. 


The diagonalized Hamiltonian is given by 
E„ata„, where 


H = 


p 


- J p'~ x "p'~ x -p 5 



(42.15) 


The point of diagonalizing the Hamiltonian is to tell us about the quasi¬ 
particle excitations from the ground state. These are the bogolons, which 
represent the motion of a large number of the original interacting bosons. 
The operator aj, creates a single bogolon. These bogolons don’t interact 
with each other so the problem is solved. 

Turning to the bogolon dispersion, shown in Fig. 42.2, we have that 
for small momenta the dispersion is linear and looks like 

Ep= (mY \ pl (42A6) 

This linear dispersion occurs in another important problem: that of 
phonons. In a vibrating system phonon quasiparticles are collective ex¬ 
citations of a large number of atoms. The small-momentum bogolons 



Ip I 


Fig. 42.2 The dispersion predicted by 
Bogoliubov’s model. At low momen¬ 
tum the energy is linear in |p|, at large 
momentum it is quadratic. 
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^We say that the particles are ballistic. 


7 

See Feynman, Statistical Physics. 


are similar, and they may be thought of as representing the compli¬ 
cated, collective motion of a large number of bosons. Loosely speaking, 
the original bosons have a number density whose behaviour is wave¬ 
like and these waves are quantized into bogolons. At large momentum 
the dispersion becomes E p = p 2 /2m telling us that large-momentum 
bogolons behave 6 like free particles with a mass m. 

Finally we note that Bogoliubov’s treatment does not predict the mini¬ 
mum seen in experimentally in Fig. 42.1. This owes its existence in liquid 
helium to the fact that the interactions in that system are very strong, 
not weak, as considered here. The minimum corresponds to excitations 
known as rotons, which represent a back-flow of Bose particles, rather 
like the motion of particles in a smoke ring. 7 


42.3 Superfluids and fields 

The argument presented above, in terms of creation and annihilation op¬ 
erators, represents a quick and illuminating way of solving Bogoliubov’s 
problem. An alternative approach is to attack the problem using the 
Lagrangian and the machinery of spontaneous symmetry breaking that 
we developed in Chapter 26. 

We started our treatment with Bogoliubov’s approximation d p= o|S2) « 
\/]Vo|f2). Actually we know that something very similar to this relation 
holds exactly, by definition, for a coherent state. If | ST) is a coherent state, 
we have a p= o|^) = VNo e ‘ e ° |H)- Note also that if this coherent state is 
macroscopically occupied then the uncertainty in the phase A cos 6 tends 
to zero, so that the phase do is a well defined quantity. Thus Bogoliubov’s 
‘approximation’ holds exactly for a macroscopically occupied coherent 
state. 

We can also understand how the occurrence of such a state necessarily 
implies broken symmetry. In real space (and ignoring time variation for 
now) the annihilation field $(cc) = ^^ p « P e lp ' 1 has a coherent state 
eigenstate \<f>{x)) defined such that 

$(x)\(t>(x)} = sj p{x)e' e{x) \(j){x)), (42.17) 

where p(x) is a number density of particles. Substituting the mode 
expansion shows that $(a;) has the property, for the coherent ground 
state |fl), that 

(n\Hx)\n) = \a p \n)e ip - x 

V P 

= -j=Y,y^ eWpeipx ’ ( 42 - ig ) 

V p 

but since, for Q), only the p = 0 state is occupied with any sizeable 
probability, this term dominates the sum and we have 

(fi|$(x)|fl) = \Zne ie °, 


(42.19) 
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where \Jn = sjN p= o/V. An important point here is that the held <L(£e) 
has developed a ground state with a nonzero vacuum expectation value, 
which is a tell-tale sign of a broken symmetry state. In fact we take 
(fi|<E>(a:)|fi) as the order parameter of the system. We also read off from 
eqn 42.17 that in the ground state |fl) the number operator expectation 
value is \Jp(x) = ^Jn across the sample and that the state’s phase is 
9{x) = 9 q. We stress that while there is still uncertainty in the number 
n in a macroscopically occupied coherent state, the uncertainty in the 
phase Qq is zero. Thus, the breaking of symmetry involves fixing the 
phase to 9{x) = 9q across the entire system. Clearly it is this picking of 
a unique phase that breaks the 17(1) symmetry of the system. 8 

How do we end up with a Lagrangian that describes non-relativistic 
bosons which is unstable to symmetry breaking? A system containing 
a fixed number of non-interacting, non-relativistic bosons has a Hamil¬ 
tonian Hq = ( E p — p)<:P p a p , with E p = p 2 /2m and where p is the 
chemical potential. Translating back into position space fields we have a 
Lagrangian for the non-relativistic system, including chemical potential, 
given by 

C = i^do^- —+ (42.20) 

2m 

This is similar to the form we had in Chapter 12. The interaction is 
included by subtracting a term |($id>) 2 . 


Example 42.3 

The full, interacting Lagrangian is 

C = i-fddo^-— V4> f ■ V4> + p& <f> - -(-fd-L) 2 , (42.21) 

2m 2 

which, with its positive mass-like term is unstable to spontaneous symme¬ 

try breaking. The Lagrangian may be helpfully rewritten (to within an ignorable 
constant) as 

c = Lt-tdo-h - — V4>t • V4- - - (n- <J> t <f>'T , (42.22) 

2m 2 V / 

and remembering that the number density of particles is given by we interpret 
n = fi/g as the boson density of the ground state. 


The Lagrangian for non-relativistic bosons with a short ranged repulsive 
interaction is therefore given by 

C = • V$ - Un - d*^) 2 , (42.23) 

2m 2 

which is unstable to symmetry-breaking. To see the consequences of 
broken symmetry we will use the polar coordinates of Chapter 12, so 
that the field looks like $(x) = \Jp(x)e 10 ( x \ and so is described by an 
amplitude-field part p(x) and a phase-field part 9(x). Our Lagrangian 
becomes 9 


C = -pd 0 9 - — 
2m 


-(Vp) 2 +p(V9) 2 


y, n 2 

- 2 \ n -p) > 


(42.24) 


8 The breaking of symmetry is often 
represented pictorially by drawing a 
set of compass needles which depict 
6{x): a phase angle which may be dif¬ 
ferent at every point in space. The 
breaking of symmetry, and formation 
of a superfluid phase, corresponds to 
all of the needles lining up, as shown 
in Fig. 42.3(a). This picture of a super¬ 
fluid is known as the XY model, where 
the name is motivated by the freedom 
of the needles to point in the x-y plane 
in the normal state. 
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Fig. 42.3 (a) Order in the superfluid 
corresponds to a uniform phase angle 
across the entire system. This is repre¬ 
sented on a lattice here, (b) A gradient 
in the phase V^(ai) results in a current 
in the superfluid. 

9 Note that & = ^/pe~ l6 and so 
a 0 <f> = i d 0 8y/pe' e + — 1 e l6 d 0 p, 

2yjp 

resulting in 

&d 0 $ = i pdoO + -do p. 
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(c) A 



Fig. 42.4 (a) The Mexican hat poten¬ 
tial in the superfluid problem. There is 
one of these at every point x in space. 

(b) The potential viewed from above as 
contours, with a minimum at p = n. 
Each value of 9 is equivalent in energy. 

(c) Upon symmetry breaking the vac¬ 
uum state picks a unique value of 9 
(here 9 = 0) at every point across the 
system. 

1 0 Particle physicists sometimes call 
this integrating out the high-energy 
particles. 

This is a Berry phase term which re¬ 
veals some interesting physics in the 
presence of vortices. See Wen, Sec¬ 
tion 3.6. 


where we have dropped the term \dop/2 as it is a total derivative and 
should therefore give a vanishing contribution to the action since our 
fields are defined to vanish at ±oo. 

This Lagrangian has a Mexican hat potential (Fig. 42.4) which has a 
minimum at p = n and is invariant with respect to global 17(1) trans¬ 
formations of phase. That is, we can swap 9(x) —> 9{x) + a and the 
Lagrangian is the same as long as a is the same at every point in space- 
time. The possible ground states of the theory are therefore to be found 
at 4>(;r) = \Jp(x)e l9 ^ x ) = ^/ne l9 ° , where 9q is the same at every point 
in space, but may itself take any value. We break the global 17(1) sym¬ 
metry of the theory by fixing the phase of the ground state to be at 
a particular value, say 9q = 0 as shown in Fig. 42.4(c). Breaking this 
continuous 17(1) phase symmetry will result in the emergence of a Gold- 
stone mode, i.e. an excitation which costs vanishingly little energy to 
excite at low momentum. 

To find the Goldstone mode we use a method much beloved of con¬ 
densed matter physicists. We are going to remove the modes that cost 
a lot of energy. 10 This will leave behind a theory that only includes 
the low-energy excitations, which (we hope) will include the Goldstone 
modes. Staring at the form of the potential, we see that excitations that 
involve climbing the wall of the potential by changing p will cost more 
energy than those involving rolling around in the gutter (changing 9). 
We therefore examine the excitations that arise from departures from 
the ground state by considering an excited state field yj p{x) = yjn + h. 
This species of excited state just involve climbing the wall slightly from 
the ground state. We use these excited states to expand the Lagrangian, 
which yield (on dropping 11 the total time derivative — ndo9 ): 

C = —-(Vh ) 2 - 2 gnh 2 - ( 2 sfiid 0 6) h - — (V9) 2 + .. . (42.25) 

2m 2m 

To remove the energetic states we may plug a Lagrangian into a path 
integral and integrate out the variable describing these states. We will 
use this technique to remove the amplitude h from our Lagrangian. 


Example 42.4 

We’ll make use of two neat functional integral tricks here. Remember that when 
dealing with the Lagrangian in a path integral, we can make the integrate-by-parts 
substitution to the quadratic parts: 

(dn<f >) 2 — m 2 (f) 2 —> —(p (d 2 + m 2 ) </>, (42.26) 

which for our equation in terms of h requires 

-(V/i) 2 — 2 gnh 2 —> —h ( -V 2 + 2gn ) h. (42.27) 

2m \ 2m J 

Next we use the one functional integral we can do, i.e. f T)<f>e^ f = 

e — 2 / J ( lK ) J ? which allows us to make the replacement 

+ m 2 )] <j>+ J<f>^ L(:r)-—y—^J(y). 

2 2 o z + m z 


(42.28) 
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Setting (f) = — \/2 h and J = \/2n<9o0, we have 

/i f-_L V 2 + 2gn) h - (2^/nd 0 9) h -* s/nd 0 S — 4, — x/nd 0 6. (42.29) 

\ 2m J (—l/2m)V +2gn 


Our Lagrangian becomes 


C = ndnO- 


2 gn - 


2m 




(42.30) 


To obtain the low-energy, small-momentum behaviour we treat the en¬ 
ergy density term (l/2m)V 2 as small compared to the potential density 
term 2gn and we find a Lagrangian purely describing the low-energy 
physics 


C 


1 fde \ 2 

2 g \dt) 


2m v 


(42.31) 


This is merely a wave equation Lagrangian with a linear dispersion and 
a wave speed c = • We have found our Goldstone mode: the 8 field 

has no mass term, so the energy of excitations tends to zero as p —> 0. 
Our manipulations have resulted in a low-energy dispersion 

£p=(-)"H (42.32) 

just as we had before. We’ve shown through another method that break¬ 
ing the U{ 1) symmetry gives the linearly dispersing mode at low mo¬ 
mentum. 


Noether’s theorem: U( 1) internal 

symmetry for 
theory 

the low-energy 

DO = 1 

n° = id°e» 

e 9 

III = —d i 0 

u m 

DC = 0 

We = 0 

4c = -§ao* 

J Nc = —V0 


42.4 The current in a superfluid 

Let’s examine the fate of Noether’s symmetry current for the low-energy 
theory. Looking at the Lagrangian in eqn 42.31, we see that the 17(1) 

translation 9(x) —> 9(x) + a leaves the function invariant. 12 This is 12 See Chapter 12 for a reminder, 
merely the 17(1) symmetry that is spontaneously broken in the ground 
state. Applying Noether’s theorem to this symmetry we have that 
D9(x ) = 1 and 

ng = ld 0 9 , n* = z&e. (42.33) 

As a result, we have the conserved currents 

4 c (d = - k g do9(x), J Nc (x) = £V0(z). (42.34) 

This final equation tells us that gradients in the phase in a superfluid 
[as shown in Fig. 42.3(b)] result in currents. The broken symmetry 
ground state of the superfluid has a uniform phase and if we deform the 
phase we set up a current in the system along the gradient. Notice that 
the current J n c depends on the strength of the condensate (or vacuum 
expectation value) given by n. In the absence of symmetry breaking, 
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(c) 


Fig. 42.5 (a) The superfluid with a 

w = 1 phase twist. Any attempt to 
untwist the phase would tear the su¬ 
perfluid. (b) The same superfluid and 
a w = — 1 vortex propagating to the 
right, (c) The propagation of the vor¬ 
tex unwinds the twist allowing the su¬ 
perfluid to dissipate momentum. 



13 

We have considered annihilating our 
broad phase twist in Fig. 42.5(a) with 
a vortex in Fig. 42.5(b). The broadness 
of the phase twist and the localized na¬ 
ture of the vortex are irrelevant. What 
matters is their topological character, 
namely that one has w = 1 and the 
other has w = — 1 so that they annihi¬ 
late each other. 


this is zero, so it makes sense that this is indeed the current that results 
from symmetry breaking. 

What does this tell us about the dissipationless flow of momentum 
which, after all, is one of the most striking aspects of a superfluid? We 
will use a beautiful argument based on topology that demonstrates that 
a moving superfluid can’t dissipate momentum. That is, it’s a runaway 
train that can’t slow down! The superfluid’s broken symmetry ground 
state is a constant field $ = y^ne l6 °. We imagine putting our superfluid 
in a box of size L with periodic boundaries $(a: = 0) = <I>(L). Now we 
boost the superfluid so it has a momentum P. This changes the ground 
state $ —> e lPx $ = y/ne?( e ° +Px \ Notice that the boost effectively twists 
the phase of our field. The boost also twists the boundary conditions; 
instead of <f>(0) = <1>(L), they become <f>(0)e lPi = <f>(L). This is shown 
in Fig. 42.5(a). To satisfy the periodic boundary conditions, we must 
have PL = 2irw, where w is an integer (known as the winding number, 
as discussed in Chapter 29). 

The nonzero momentum state has more energy than the ground state. 
If the superfluid can slow down it will lower its energy. Now we ask, can 
we untwist the superfluid phase to remove this extra momentum? The 
answer is no, we can’t. The nonzero momentum state involves an integer 
number of 27 t rotations of the phase angle between the ends of the box. 
The angles at the ends of the box are locked to each other by the periodic 
boundary conditions, so removing the twists will lead to singularities in 
the phase field, costing enormous amounts of energy. 

The only way we could remove a twist is to create a vortex, a localized 
phase twist. Such vortices carry integer winding numbers, so creating 
a vortex which travels across the box will remove a twist as shown in 
Fig. 42.5(b). 13 The creation of a vortex is a tunnelling process with a fi¬ 
nite energy barrier. Ignoring such subtleties would lead us to believe that 
a moving superfluid is a perpetual motion machine. However, Nature 
is not so kind and allows a quantum tunnelling process which unwinds 
the condensate. As a result, the superfluid can, over a very long period 
(and even in the absence of excitations) dissipate some momentum. 

Our model of weakly interacting bosons captures the small and large 
momentum behaviour of the dispersion, so explains at least some of the 
physics of the excited superfluid. One might question whether this dis¬ 
persion is essential to the existence of the superfluid state. It is essential 
since it guarantees that the moving fluid can’t dissipate too much mo¬ 
mentum through its excitations. The reason is that a linear dispersion 
doesn’t give rise to enough states for quasiparticles to scatter into and 
dissipate momentum. To see this consider the density of states g(E) for 
a three-dimensional fluid of bosons. For the linear dispersion of a su¬ 
perfluid, this varies as g(E) oc E 2 , whereas for the quadratic dispersion 
of a normal particle, this varies as g{E) oc E^. As a result, there are 
fewer excited states available for a superfluid than a normal fluid at low 
values of energy, reducing the phase space for scattering events that will 
reduce momentum. 




Exercises 379 


Example 42.5 

It is also worth noting another beautiful argument by Landau that shows an upper 
bound on the critical velocity for superfluid flow is given by (ng/m) 2 , a quantity 
which only exists due to the presence of interactions. In the simplest version of this 
argument, we imagine the superfluid as a body of mass M moving with velocity 
v. The body creates an excitation, with momentum p and energy E p , with the 
result that the body’s velocity changes to v '. By conservation of momentum, we 
have Mv = Mv' p, from which we see that the body ends up with kinetic energy 

— = - v-p+ 2 M’ ^he P°i n t 1S that the excitation cannot be created unless 

we have M // > -(- E p , from which we conclude that 

P 2 

E p <v p~ — « v • p, (42.35) 

if M is assumed large. In other words, we have an upper bound on the superflow 
against creating excitations, given by E p /\p\ = (ng/m) 2 . So while a non-interacting 
Bose gas (which has g = 0) will undergo a transition into a Bose—Einstein condensate 
at low temperatures, such a state will not exhibit a superflow as the critical velocity 
is zero. 


Chapter summary 

• Bogoliubov’s approximation states that the number of particles in 
the p = 0 ground state becomes macroscopically large, allowing the 
replacement of operators a p= o and hp =0 with the number '/Nq. 

• The effective Hamiltonian in a non-relativistic Bose gas can be 
diagonalized using a Bogoliubov transformation, and the quasi¬ 
particles are called bogolons. 

• The low-energy excitations in a superfluid are Goldstone modes, 
the broken symmetry is the phase 9 and the superfluid current is 

Jnc = £V0(s). 


Exercises 


(42.1) Verify eqn 42.15 using the method suggested in the 
text. You may find it useful to make the substitu¬ 
tions 

Up — cosh 6 P , 

v p = sinhflp. (42.36) 


(42.2) Verify the algebra leading to eqn 42.31. 

(42.3) (a) Show that, in three dimensions, the density of 
states g(E) = dN(E)/dE for a superfluid goes as 

9(E) <xE 2 . 

(b) Show that for a Fermi gas g{E) oc E?. 
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The sentries report Zulus to the south west. Thousands of 
them. 

Colour Sergeant Bourne (Nigel Green), Zulu (1964) 

The physics of large numbers of non-relativistic particles and their in¬ 
teractions is a branch of quantum field theory known as the many-body 
problem. In this chapter we’ll examine the properties of a large number 
of non-relativistic fermions confined in a box, which is the basis of the 
description of electrons in solids. As we’ve seen, non-relativistic fermions 
scatter via the diagram shown in Fig. 43.1. The full formalism of Dirac 
spinors isn’t required to understand this problem owing to the low en¬ 
ergies involved in metals and so we can treat the particles as interacting 
via an instantaneous potential V(x — y) = e lq '( x ~ v ^V q . Rather than 
starting with propagators and path integrals, we’ll gain some intuition 
by treating the interaction with some very simple approximations. 



43.1 Mean-field theory 

We start with the Hamiltonian 


H = H 0 + V, 


where H$ = 


PLnt 


p 2 m a p a P 


and 


Fig. 43.1 The Coulomb vertex. Note 
that in this chapter the transferred mo¬ 
mentum runs from left to right. 


v = 


2 ^q“p-g“fe+<3“ fc “p> 

pkq 


(43.1) 


(43.2) 


1 It is worth noting that |0) doesn’t 
mean that we have no particles in 
the ground state. Usually in con¬ 
densed matter physics we study sys¬ 
tems with a finite density of parti¬ 
cles at zero temperature. The non¬ 
interacting ground state for a metal is 
a gas of electrons stacked up in en¬ 
ergy up to the Fermi level pp. So 
if we take spin into account then the 
ground state of a metal may be written 

|o> = ri|p|<p F a pT“pii Empt y box >- 


where the anticommuting operators a) p and a p create and destroy 
fermions respectively. We take Hq as the dominant contribution and 
treat V as a perturbation. As usual we call |0) the ground state of H 0 
with energy E 0 and Q) the ground state of the full Hamiltonian of H 
with energy E. 1 

We need a way to deal with the perturbation provided by the com¬ 
plicated two-body potential V. A good first step is to ask what the 
mean-field behaviour of the system is. Mean-field theory is a scheme 
of approximations that crops up all over physics and involves taking 
a system of many particles and asking how a single particle reacts to 
the average behaviour of all of the others. In the context of many-body 
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theory, we will interpret the mean-field approximation as a method of re¬ 
placing pairs of operators with averages, or more correctly, with vacuum 
expectation values (VEVs), and we will later link this to the conven¬ 
tional idea of mean-field theory. The mean-field correction A E to the 
non-interacting ground state energy Eq is found by taking a VEV of the 
perturbation, which is to say we approximate the expectation value of 
the troublesome four-operator term taken with the unperturbed ground 
state: 2 (0|a£ _ q a£ +q a fe ap|0). 


- [4+q a p] - N [“£-<,«*] 


it 

l k+q G 


(43.3) 


Example 43.1 

Wick’s theorem reduces the four operators in eqn (43.2) into products of contractions 
of pairs of (normally ordered) operators. 3 We therefore apply 

„ + -t - ~ TT a 7 - all paired 

a' „CLu I O'kO'V —> I I iv 
p-q k+q K v ll L contractions 

Wick’s theorem on our string of four operators yields 

“p-q a i+q afca P = N [ a ^? a fc+ q a fc a p] 

+ a p~q a l +q N fikdp] + N [ a J_,4 +g ] d k ci p 

+ N [ a l+q a fe] + N [ a p-q a p] a l+q a fe 


+ a p-q a P a l+ q a fe ^p-q^kh\, + q Clp . 


(43.4) 

Everything within the normal ordering signs is normally ordered as written, so these 
can be dropped. Mean-field theory involves replacing the contractions with averages 
taken over the ground state: (0|O|0). 


The terms with uncontracted operators represent excitations, and so 
we ignore them for now, since we’re concerned with the ground state 
properties. For the ground state energy shift (V"), we therefore only 
consider the completely contracted terms and this procedure therefore 
yields up the three terms: 


2 

This is exactly how we usually work 
out the energy shift caused by a pertur¬ 
bation in first-order perturbation the¬ 
ory, so represents nothing new. Nov¬ 
elty comes with the next step, where 
this VEV is boiled down to something 
more useful using a version of Wick’s 
theorem. 


3 

Since we’re dealing with fermion oper¬ 
ators in this chapter then we need to re¬ 
member that swapping the order of two 
operators when we bring them together 
and make a VEV, gives us a minus sign. 


4 The Cooper term will be important 
when we come to deal with supercon¬ 
ductors in the next chapter. We’ll post¬ 
pone discussing it until then and, for 
the moment, use the fact that under 
normal circumstances, the combination 
(OlaJj.aJ™|0) = 0 for any n and m, and 
so we expect no contribution from Co- 


pkq 


(®p-q®fe+q) + (dp-q®p) (dfc+q®fc) (d-p-qdfc) (®fe+q®p/ 


leads to C 0 
= Cq + Do + Fo , 


leads to D n 


leads to F n 


(43.5) 


where we’ve shortened (0|O|0) to (O) to save on clutter. We’ll call Cq 
the Cooper term, 4 D 0 is Hartree’s direct term and F 0 Fock’s exchange 
term. 

Let’s examine the Hartree term in more detail. It gives rise to a 
contribution to the ground state energy of 

Do = 2 ^(4Vp}(4+q<W- 

pkq 


Douglas Hartree (1897-1956) 
Vladimir Fock (1898-1974) 


(43.6) 



382 The many-body problem and the metal 


We know that (0|aj.a a |0) = 0 unless r = s, so we can immediately fix 
q = 0 in eqn 43.6. Recall that the V q s are Fourier components of the real 
space potential V (r). A wave with wave vector q = 0 is a constant (that 
is, it has an infinitely long wavelength), so the Hartree term evaluates 
the energy due to the constant part of the potential. Maybe this is what 
we would expect from a first-order guess! We obtain 

A) = \^y q =o{ala p ){a\a k ). (43.7) 

pk 



(b) 


f y 



Fig. 43.2 (a) The basic interaction 

diagram, (b) The double tadpole di¬ 
agram giving the Direct (or Hartree) 
contribution to the ground state energy, 
(c) The double oyster diagram giving 
the Exchange (or Fock) contribution to 
the ground state energy. 


Notice that we’ve reduced the VEVs of operator pairs to simple number 
operators and the Hartree term has become 



The mean-field treatment has a diagrammatic interpretation: we can 
take the interaction diagram and join the legs of the operators that we’re 
averaging (Fig. 43.2). This turns out to be equivalent to the more usual 
ways of getting diagrams with the .S'-matrix or path integral approach. 
The Hartree term can therefore be represented in diagrammatic form as 
the double-headed tadpole shown in Fig. 43.2(b). 

Another point to note is that for theories where there are no par¬ 
ticles in the ground state |0) the tadpole [whose head corresponds to 
(0|aj,a p |0), see Fig. 43.2(b)] gives zero. This isn’t the case for nuclear or 
electronic matter however, where we have a finite number of fermions in 
the ground state. 


Example 43.2 

Let’s see how the Hartree term looks in terms of position space fields. We examine 
the product (a p — q a p ) We have that a p = f d 3 xip(x)e~ ip ' x , which enables us to 

say i r 

= y / d3 zdV (x)y>{x l ))e'( p - q '> x e~ ilp * . (43.9) 

This in turn allows us to write the Hartree energy as 

Do = ^2 J d 3 xdVd 3 ydV V q {^ (x)ip(x')){ip^y)ip(y')) (43.10) 

p,k,q 

Xe Kp-q) x e -ip x ' e K k +q)y e ~ ik y' 

Doing the sums over p and k yields V(5^ 3 ^(* — x') and V5^\y — y') respectively, 
which eat up two of the space integrals. We’re left with 

D 0 = i f d 3 xd 3 y Fg(^ t (ab^(ab)(^ t (y)^(j/))e _11, ' (a! -S'). (43.11) 

1 q J 

Next we sum over q , which takes an inverse Fourier transform of the potential V q , 
and we obtain 

Do = i ! d 3 xd 3 yV(x-y)(ip l! (x)ip(x))('il>^(y)ii[>(y)). (43.12) 

This is just what one might have guessed classically for the energy of two charge 
distributions p(x) and p(y) interacting via a potential V(x — y). 
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Of course, decompressing the notation here 

($(x)ip(x)) = — lim (OlTWi', a:) 10), (43.13) 

x' —> X 

is related to a propagator. 5 The internal lines that form the double 5 See also eqn 31.18. 
tadpole’s two heads are just propagators, as we expect for a respectable 
Feynman diagram. 

Next we examine the Fock term. This is represented by the double 
oyster diagram in Fig. 43.2(c) which corresponds to the equation 

F o = ~\Y1 ^(«p-g afc )(4+q a p)- (43.14) 

pkq 


Example 43.3 

Again, in order to make the VEVs nonzero, this gives us a condition on q. This time 
we have that p — q = k, leading to a result 

Fq = — — V^,_fc(a^afc)(a^dp) 

pk 

= (43.15) 

pk 

which is diagonal (that is, contains only number operators) but is clearly more com¬ 
plicated than what we had for the Hartree term. This is shown diagrammatically 
as the double oyster in Fig. 43.2(c). There is a lot of physics in the exchange term 
including, for example, magnetic ordering in metals, as examined in the exercises. 

The position space version unfolds much as before, although now the momentum 
sums mix up tc’s and y’s, giving us 

F 0 = -i Jd 3 xd 3 y V(x - y){ft (y)4>(x)). (43.16) 


43.2 The Hartree—Fock ground state 
energy of a metal 


We will use our mean-field results to work out an approximate value 
of the total ground state energy of a metal. This will involve potential 
energy contributions from the Hartree term and the Fock term and this 
is therefore known as the Hartree—Fock ground state energy. 6 


Example 43.4 


Recall that we put non-interacting electrons into a box, with one spin-up and one 
spin-down electron in each momentum state. The electrons stack up in energy and the 
upper-most occupied energy level is called the Fermi level. The number of occupied 
momentum states is 


E 

IpKpf 


v[ 

J |p|<PF (^Tr ) 3 



Vp| 
67r 2 ’ 


^The other contribution is from Ho and 
reflects fact that each 2 electron has ki¬ 
netic energy E^ = E— and obeys the 
Pauli principle. Our approach will be 
to add these kinetic energies up by not¬ 
ing that the sums are carried out up 
to pf an d that states are so closely 
spaced that we may make the replace- 

me nt E, P |< PF - V ^ p|<pp 


(43.17) 
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and with two spin states per momentum level, the total number of electron states is 
N = Vp|/ 3 tt 2 . The electron density of the system n = N/V is related to the Fermi 
momentum pp via pp — (37r 2 n) 3 , and the Fermi energy is Fp = pp/(2m). The total 
energy of the box of non-interacting electrons is given by 


Wq 


2 £ 4 °> 

|P|<PF 


2V [ 

J\p\<pf ( 2 0 3 2m 


3 

5 


NEp, 


(43.18) 


and so the kinetic energy per electron Wo/N = 3 Ep. A popular unit for energy 
among theoreticians is the Rydberg (Ry), equal to h 2 /(2m e ft^) where ao is the Bohr 
radius. Writing the volume occupied per electron as 1/n = j ttF where r is the 
average distance per particle, and using the dimensionless length r s = r/ao, we can 
express the kinetic energy per electron as 


Wo 3^_, 3 / 97r\ 2//3 1 Ry ^ 2.21 Rydbergs 

N 5 5 \ 4 / rj r£ electron 


(43.19) 


Named by John Bardeen. 


Now for the potential energy. This is not something we can calculate ex¬ 
actly, but our mean-field approach tells us that to first order we should 
add the Hartree term Dq and Fock term Fq. However an immediate 
simplification is possible. Metals may be modelled as charge-neutral 
boxes full of electrons interacting via the Coulomb force. Although, 
in reality, metals contain positive ion cores and mobile electrons, this 
level of detail is not always necessary. Instead, we use a model known' 
as jellium in which the electrons are put in a box full of a homoge¬ 
nous, positively charged jelly in order to guarantee charge neutrality. 
This uniform positive charge exactly cancels the contribution from the 
Hartree term which originates from the uniform electron distribution. 
We therefore only need consider the Fock contribution to the energy 
given by F 0 = — \ J2 pk V p -k(N p }(N k }. The ingredients of the sum are 
the ground state occupation numbers 


(N P ) 


i M < pf 

0 \p\>PF, 


(43.20) 


and the Fourier transform of the interaction potential V p . 


^Here we return to SI units, since these 
are often used in the many-body liter¬ 
ature. 

^ The integral was evaluated in Chap¬ 
ter 17. 


Example 43.5 

The electrostatic potential energy between two electrons is given by 8 V(x — y) = 

-r. The Fourier transform can be most easily evaluated by working with 

47reo|£c— y\ 

the screened potential energy V(r) = (e 2 /47reo)e /|r*| and then sending A —> 0. 

Thus 9 
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43.3 Excitations in the mean-field 
approximation 

Next we examine the changes the interactions can make to the dispersion 
relations of single particles in the metal. For the moment we will return 
to the simple minded picture of a box of fermions, which does not include 
jellium’s positive background. 

The excitations of a quantum field theory are particles, which have a 
characteristic dispersion E p . To examine the excitations of a field theory 
we want to manoeuvre our Hamiltonian into the form H = E p a) p a p . 
So when we examine the fate of particles in mean-field theory we try 
to massage the two-particle interaction, described by the four operators 
in eqn 43.2, into one that describes a single particle interacting with an 
external field. The external field here is caused by the effect of all of the 
other particles, which (finally) explains the logic of naming this proce¬ 
dure a ‘mean field’ approximation. Recall from Chapter 4 that a single 
particle in an external potential is described by an operator V q a p+q a p , 
so we want to turn the tricky term a p _ q d} k+q aka p into a sum of terms 
that look like this. Actually we want to go further than this and have 
incoming and outgoing electrons residing in the same momentum state 
giving V^ F d^ p a p , where V^ F is the effective mean-field potential 
through which the particle with momentum p passes. We do this with 
a recipe: we use Wick’s theorem to generate terms containing the VEV 
of two of the four operators and treat this VEV part as an effective 
potential. 


Example 43.7 

Returning to our Wick expansion of the operator we identify the terms with one 
contraction and one pair of uncontracted operators. We obtain a sum of terms: 

D = (“l+q a fe) a p-9 a P + (“p-q a p) a i + q a fe' 

F = — ( a fc+q a p) a p^q a fe — ( a p-q a fe) a l+q a p> 

C = ( a l+q a p-q) a P a fe + ( a P a fe)4+q a p-q- (43.29) 

We will continue to ignore the Cooper terms for now. The two terms in the D sum 
are identical after a re-indexing of the sums, and the same is true of the two terms 
in the F sum. We end up with 

D = 2 ( a l+q a k> a p-q a P’ 

F = —2 (dp_ q ai < .)d k+q dp. (43.30) 


We now have a Hamiltonian that reads 

p 2 

H = '^2 — Vq{0'p-q°‘h)d‘k+ q Q“ P - 

p qpk qpk 


(43.31) 
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Fig. 43.5 (a) The tadpole, representing the first-order Hartree contribution to the 
single-particle energy, (b) The oyster giving the first-order Fock contribution to the 
single-particle energy, (c) The sum to infinity of oyster and tadpole diagrams. 


Now we’re in a position to examine the simplified version of the two- 
particle interaction. First, let’s examine the direct term: exactly as 
before the VEV part fixes q = 0, giving a number operator and the 
expression 


^direct 


^ ] Vq=o{&]z&k) UpUp 
pk 


E 


0 El(o-fcQfc) 

k 




(43.32) 


Comparing with our equation for a single particle in an external poten¬ 
tial, we see that it looks as if each of the particles is sailing through 
an effective potential = kq=o X)fc(-Nfc). This is represented by the 
tadpole diagram in Fig. 43.5(a), which is generated by joining up two of 
the four legs. Notice that there are two ways of forming this diagram, 
re-creating the factor of two we had from Wick’s expansion. 

Next we examine the exchange term 


^exchange — ^ ^ q^fc)^fc-t-o^P‘ (43.33) 

qpk 


In order to get a nonzero VEV, we just consider the terms with p = k-\-q. 
Then we obtain 


^exchange 



P 


. k 


Up hp . 


(43.34) 



388 The many-body problem and the metal 


This corresponds to a single particle interacting with an effective poten¬ 
tial V™ F = ^2 k Vp-k(Nk), which we represent as the oyster diagram in 
Fig. 43.5(b), generated by joining two opposite legs from the interaction 
vertex. 

Notice that the effective potentials we have derived are exactly the 
first-order contributions to the self-energy £ p of a particle. They are 
each represented by a part of a Feynman diagram we can fit between 
the two external legs. We can therefore write the energy of the particles 
as a sum of kinetic and self-energy parts 

Ep= + (43.35) 

where 


Sp D) = V q=0 J2 k (Nk), ^ F) = - Efc Vp- h (N h ). (43.36) 


For the case of jellium metal, we note that the Hartree self-energy S p D ^ 
is cancelled by the positive background charge (just as previously) and 
the Fock term becomes £ p ; from eqn 43.23, justifying its assignment 
in the previous section. The energy of electrons in the Hartree-Fock 
approximation is therefore given by 


E p = ?_ 

2m 


PF 

IT 



(43.37) 


which is shown in Fig. 43.4(b). It is worth stressing at this point that this 
is not what is measured in a real metal! The Hartree-Fock approxima¬ 
tion doesn’t capture the physics very well and higher order corrections 
are necessary. This requires the full machinery of quantum field theory 
including propagators and Feynman rules and it is to this we now turn. 


43.4 Electrons and holes 

So far we have treated our metal as a box of electrons. For more serious 
calculations it is useful to distinguish two sorts of excitations. This is 
because a metal comprises electron states filled up to pf and so the 
‘vacuum’ for the metal is not empty. With this in mind we define new 
operators 


13 r* 

Notice that, defined this way, elec¬ 
tron operators only act for states above 
the Fermi surface \p\ > pf while hole 
operators only act for states below the 
Fermi surface \p\ < pp. This, in turn, 
means that we describe the system as 
having electron excitations above the 
Fermi energy and hole excitations be¬ 
low the Fermi energy. 


a P = 0(\p\ -p F )c p + 0(p F - \p\)bl, 

«p = Q(\p\ ~ Pf)cI + 0(pf ~ \p\)i> p , (43.38) 

with anticommutation relations 

{cp,^} = 6^(p-q), |S p ,6t| = <5 (3) (p- q), (43.39) 

with all other anticommutators vanishing. We say that the c p operators 
describe electrons , while the b p operators describe holes . 13 



43.5 Finding the excitations with propagators 389 


Defined in this way the kinetic energy of the system is given by 


Ho = ^2 E P + X! HpclpCp - ^2 Epbpbp- ( 43 . 40 ) 

IpKpf IpI>pf IpI<pf 


The first term is the ground state energy of all of the states, filled up to 
the Fermi level. The second term accounts for electron excitations. The 
final term accounts for hole excitations. Note that, with these definitions 
the holes make a negative contribution to the energy. Although there 
is a sense in which holes are antiparticles, they are not identical with 
positrons whose contribution to the total energy is always positive. 

To perform calculations we need an electron hole field operator, which 
is given by 


^(x) = —j= Y2 [°(\P\ -Pf)c p + 9(pf - \p\P p 
v P 


( 43 . 41 ) 


43.5 Finding the excitations with 
propagators 

Our rather ad hoc mean-held procedure will not get us much further. 
Fortunately, we can use the technology of propagators to formalize the 
procedure, turning it into a perturbation series. This will have the ad¬ 
vantage of showing how the full expansion of the four-operator term 
will go. Recall that, since we are dealing with an interacting theory 
we should think of our electrons and holes as quasiparticles. In the fol¬ 
lowing example, we will recap the steps needed to make a perturbation 
expansion and re-create all of our results so far in a hash. 14 


Example 43.8 

The interaction Hamiltonian is 'Hi{x—y) = (ajJ'i/d( y)V(x—y)i/j(y)'ip(x)8(x°—y 0 ). 

This interaction gives us an S-operator 


g — g-j J d 4 a;d 4 y ^t(a;)^/>t(j/)V(a:—j/)^/>(y)T/’(a:)5(x 0 —y°) 


(43.42) 


The basic building block of the perturbation expansion is the Feynman propagator, 
which we can obtain as the Green’s function of the equation of motion. The equation 
of motion for non-relativistic electrons is the Schrodinger equation, so we want 

(^Ep — i—) Go{p) — —i<5(t), (43.43) 

2 

with E p = E— . We obtain the propagator in momentum space as 


Go(p) 


i 

E — Ep + ie 


(43.44) 


where we have added ie in the denominator to avoid integrals hitting the electron 
pole. This choice is discussed in Appendix B. 


The use of this expansion to find the 
electron-hole propagator is examined 
in Exercise 43.4. 


^See Mahan, Chapter 2, P. Coleman, 
Chapter 8, or Abrikosov, Gorkov and 
Dzyaloshinski, Chapter 2, for the full 
story. Note also that we return to rel¬ 
ativistic notation here, so that = 
(£, *), = (E,p), etc. 
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w 6 <b) v 

^ v 

Fig. 43.6 Non-propagating lines. 

^As shown in the examples below, it 
forces us to close the contour in our in¬ 
tegrals in a particular manner, guaran¬ 
teeing that we get a sensible answer. 





Fig. 43.7 Corrections to the ground 
state energy of a metal. 


With the inclusion of holes, the free propagator for electrons in a metal is given 


by 


Go ip) = 


iflflpl ~ Pf) ifl(PF - |p|) 
E — En + ie 


(43.45) 


which is a sum of electron and holes parts. This expression is similar to what we have 
for scalar particles, although with a differing sign. Since, for a particular choice of p, 
we only need one of the terms in the expression for Go(p), a much simpler approach 
that is commonly employed is to write the electron-hole free propagator as a single 
expression 


Go(p) = 


E — Ep + i<5p 

where S p = +e for p > p\ and 8 P = — e for p < pp. 


(43.46) 


Now that we have a propagator and an interaction, we’re ready to write 
down the Feynman rules for a metal. 

The Feynman rules for a metal 

• A factor Go(p) = e _q +i(5 for each internal line. 

• A factor — iV q for each interaction vertex. 

• A factor —1 for each closed fermion loop. 

• Conserve energy-momentum at the vertices. 

• Integrate over unconstrained energies and momenta with a mea- 

sureV f (£Y' 

• A convergence factor e lE0+ for non-propagating lines. 

The final point is necessary for the two diagram parts shown in Fig. 43.6 
and stems from the instantaneous nature of the interaction, which ap¬ 
pears to allow lines to propagate instantaneously. 15 We’re now ready to 
go and calculate some things with the Feynman rules. 


43.6 Ground states and excitations 

Ground state energies may be evaluated from perturbation theory by 
summing Feynman diagrams. As discussed in Exercise 43.5, this involves 
finding the amplitude <0|S|0> = e^( Connected vacuum diagrams) ; from 
which it follows that 

^ / Connected \ 

E ^ V vacuum diagrams / 

-V = —-- 4 ( 43 - 47 ) 

The lowest order vacuum diagrams are the double tadpole and oyster, 
just as we had with the mean-field approximation. The mean-field trick, 
which gave us the Hartree-Fock approximation, represents first-order 
terms in the perturbation expansion of the ground state energy. Higher 
order terms in the expansion give us diagrams such as those others shown 
in Fig. 43.7. 
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What are the particle energies? These can be accessed through the 
interacting propagator of the theory, which includes the effect of all self¬ 
energy contributions. Recall the exact result that 


G{p) 


i Z p 

E — Ep + ir p 


+ 


multiparticle 

parts 


(43.48) 


so we should be able to read off the dispersion from the position of 
the pole in the denominator. To find the full propagator we use the 
prescription 


<%>) = E 


Connected diagrams with 
two external legs 


(43.49) 


We will use Dyson’s formula to formally sum the effect of self-energy 
processes to infinity and renormalize the propagator. As before, we 
define the 1PI self-energy as 



Amputated 1PI diagrams 
with two external legs 


(43.50) 


The sum to infinity, shown for the example of Hartree-Fock theory in 
Fig. 43.5(c), yields Dyson’s equation for the propagator 


G(P) 


1 

GoCp^ + iSCp) 


i 

E — E p — S(p) + i(5 p 


(43.51) 


so the energy of the excitations is E = E p + Rc |^E (E,p) 
contributions to £(p) are shown in Fig. 43.8. 


. Some of the 



Fig. 43.8 Contributions to the electron 1PI self-energy of the metal. 


Example 43.9 

Let’s examine the renormalization of the electron lines by first-order processes. The 
only self-energy terms containing one interaction wiggle are the tadpole and oyster 
diagrams shown in Fig. 43.5(a) and (b). This is, of course, identical to the mean-field 
result and demonstrates, again, that the mean-field approximation predicts the same 
contribution as first-order perturbation theory, namely the Hartree-Fock contribu¬ 
tions. The joy of the propagator theory is that it tells us how to make sense of the 
self-energy contribution: we can use it to renormalize the propagator by summing to 
infinity. 
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Fig. 43.9 The integration in eqn 43.53 
can be carried out in the complex plane. 


In terms of propagators the diagrams correspond to 

—iS(Tadpole) p = (-1) J2 j ^(-iV q=0 )G 0 (k)e iE0+ , 

k 

-iS(Oyster) p = ^ J ^ (-iV p - k )G 0 (k)e lE0+ . (43.52) 

k 

It’s instructive to use the complex analysis of Appendix B to show the use of the 
e 1 ^ 0 factors. Let’s examine the integral 

[ G 0 (k)e iE0+ , (43.53) 

J 27T 

whose pole structure is shown in Fig. 43.9(a). If we complete the contour in the 
upper half-plane [Fig. 43.9(b)] then we pick up all of the momentum state poles for 
\k\ < PFj completing in the lower half-plane picks up the momentum state poles for 
ifci > pf- The convergence factor e l£ ' 0+ makes the choice for us! If we complete 
in the lower half-plane (where imaginary E gets very large and negative) this factor 
will diverge. On the other hand, it will vanish for a contour completed in the upper 
half-plane in the limit that the contour becomes very large. We therefore pick up the 
poles in the upper half-plane for all \k\ < pf, each contributing residue i. We find 

[ ^G 0 (k)e iE0+ (27ri)i7V fe , (43.54) 

J Z7T Z7T 

where Nk is the occupation number distribution which is unity for |fc| < pf and 0 
otherwise. We obtain expressions for our diagrams of 


—iE(Tadpole) p 

k 

(43.55) 

—iE(Oyster) p 

= £(—i V P - k )(-N k ). 

(43.56) 


k 


For the case of jellium metal, with its positive background charge, the tadpole 
contribution is cancelled and we only need consider the oyster contribution. We 
obtain for the oyster diagram 


E(Oyster) p 


e z 1 

i fc fe P Veo |p ~ fc|2 


PF 


7T 47T6Q 



L 


|fc|<p F 


d 3 k 1 

(27r) 3 |p — k \ 2 ’ 


(43.57) 


^ Close to the Fermi level, we may ex¬ 
pand the energy of an excitation as 


Ep — E~p-\- 


m 


(|p|-p F )+- • ■ 


which, for a non-interacting system, is 
given by 

e p = + —(IpI -p F ) + •■• 


from which we identify m* = 

17 

We ignore the heavy fermion materi¬ 
als where m* Ri 10 3 m. See P. Coleman, 
Chapter 16, for a discussion of these 
systems. 



which is, of course, the same result as discussed earlier in this chapter. 

Finally, the renormalization of the electron line from the oyster contribution, 
achieved with a sum to infinity, is shown in Fig. 43.10, showing that the quasiparticle 
energy is shifted to p 2 /2 m + S(Oyster) p . 


So far we have only re-created the same results as seen earlier. To see 
the motivation for going further one can calculate the effective mass for 
Hartree-Fock theory. 16 

The result of computing the mass m* corresponding to the disper¬ 
sion E p following from our expression for the Hartree-Fock energy 
(eqn 43.57) is that m* = 0 close to the Fermi energy, which is clearly 
wrong: electrons should have effective masses of order 17 m. The prob¬ 
lem here is that the Hartree-Fock theory is completely static. It treats 
an electron as if it’s propagating in the static field of all of the others. 
In reality electrons will alter their configuration dynamically resulting in 
time-dependent correlations. These may be included in the calculation 
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+ 



1 

^ — 1 _ 


Fig. 43.10 The summation of the Fock terms to infinity. 


through the inclusion of more processes in the electron self-energy as 
shown in Fig. 43.8. 

The next logical step would be to evaluate contributions to the self¬ 
energy that include two interaction wiggles, including the pair-bubble 
diagram in Fig. 43.11. However examining the amplitude for this dia¬ 
gram we find that it is divergent for small q. This looks like a problem! 
However, this divergence is avoided if we take a side-step and, rather 
than concentrating on corrections to the electron self-energy, we con¬ 
sider a correction analogous to the photon self-energy of Chapter 41. 
This will have the effect of summing up a class of diagrams that are 
similar to that shown in Fig. 43.11 which will remove the divergence 
in the electron self-energy. This summation is the subject of the next 
section. 


43.7 The random phase approximation 

For an electron gas with a high density of particles the most important 
correction to the Hartree-Fock approximation is provided by the lowest 
order correction to the wiggle in the interaction vertex. For historical 
reasons 18 this is known as the random phase approximation or RPA 
and was first formulated by David Bohm and David Pines. 19 This correc¬ 
tion to the interaction vertex has much in common with the self-energy 
correction to the photon propagator in QED and represents a change 
in the potential felt by the electrons due to the effect of shielding. The 
idea is that the interaction wiggle creates an electron-hole pair, which 
annihilates shortly afterwards. 

This state of affairs is known here, as in QED, as a polarization 
process and is shown in Fig. 43.12. It is one example of a class of 
1PI self-energy processes that can be inserted into an interaction wiggle 
whose sum is a Green’s function denoted in(g), which we define similarly 



Fig. 43.11 The pair-bubble contribu¬ 
tion to the self-energy. 

^The randomness of a phase is unim¬ 
portant in our description of the RPA. 
The reason for the name comes from 
an alternative treatment where a term 
is shown to be neglectable, 
where xi labels an electron’s position. 
The physics of this approximation is 
that if xi is distributed over a large 
range then the random phases that re¬ 
sult will tend to cancel. 

19 David Pines (1924- ) 



Fig. 43.12 A first-order polarization 
process showing a bubble in the inter¬ 
action line. 
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to its cousin in QED as 

in (q) = E 


All 1PI diagrams that can be inserted 
into an interaction wiggle 


(43.58) 


Some of the terms contributing to in(< 7 ) are shown in Fig. 43.13. 



In order to fully take account of ill(g) we need to sum the contribution 
to infinity. Since we don’t have a photon propagator in this theory, these 
renormalize the potential, changing the interaction strength from — \V q 
to the effective potential —i V e s(q). The sum to infinity is simply carried 
out in the usual manner: 

-i V e ff(q) = 


The interaction potential V q therefore becomes an effective potential 
V e g(q) = V q /{\ — VglT^gr)], which is often written V e g = V q /e(q), where 
the permittivity e(g) = 1 — V q Tl(q) is a function of energy q° and wave 
vector q. So, just as in QED, the vacuum state of the metal takes on 
dielectric properties caused by electron-hole pairs buzzing in and out of 
existence. The physics of these dielectric properties lies in the screening 
of the electrons from each other’s charge by the electron-hole pairs. 
Finally we note that since e(g) is a function of energy, the inclusion 
of polarization diagrams will provide the all-important time-dependent 
correlations whose neglect resulted in a nonsensical effective mass in the 
previous section. 

Our treatment of 11(g) has so far been quite general. In the RPA we 
evaluate the effect of only the lowest order contribution to 11(g), that is, 
the electron-hole bubble shown in Fig. 43.12, whose amplitude we will 
call If (g). (The sum to infinity of this interaction that renormalizes the 
photon line in the RPA is shown in Fig 43.14.) 


—iljj + 


il4 


i%) 


-iV a 


+ 


-iV q 

-iV a 


iH(g) 


-iV„ 


in(g) 


-iV„ 


1 - V q n(q) 


+ ... 

(43.59) 
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Fig. 43.14 The sum of bubbles in the interaction line. 


Example 43.10 

Including a minus sign for the fermion loop, the amplitude of the bubble in Fig. 43.12 
is given by 


i n(q) = — V 

= -V 

= V 


/ 

f d + r 
J (27r) 3 J_ 


d 4 q 

(2^F 

d 3 p 

(27r) 3 J_ oa 2-it p° + q° - E p+q + iS p + q p° - E p + i<5 p 
d 3 p 1 


G 0 (p + q)Go(p) 

dp 0 


I 

r°° dp 0 / l 

J-oo 2 IT Vp° + q° - Ep 


(27r) 3 (E p+q - i5 p+q ) - (E p - i5 p ) - q° 
dp 0 ( 1 1 


Ep-\-q -|- i5p-\-q p Ep -\- i<5p 


(43.60) 


where, in the final line, we have rewritten the amplitude in a form which allows 
us to do a contour integral (see Appendix B). The integrals over p° have poles at 
= — q° + Ep+q — iSp+q and p° = E p — i 5 P respectively. The residues for these 
are Fermi functions and so we pick up factors 27ri7Vp+ g and 27riTVp respectively and 
obtain the result 20 



d 3 p 1Vp+q - N p 

(27r) 3 Ep+q — E p — q° 


(43.61) 


The integrand in eqn 43.61 is known as the Lindhard function. The evaluation of 
the integral involving the Lindhard function is carried out by P. Coleman, Chapter 8. 
The result is a rather complicated expression for e(q). Analytically continuing to 
imaginary energies and doing the integral it is found that 


7f(iq 0 ,q) 


-\q{E ¥ )r 


i q° kb 

4E+ 5 2pp / 


(43.62) 


where g(E+) 


d N p 
dE P e f 


is the density of states at the Fermi level and 


v - + 1 
X 

f-i ’ 
(43.63) 

is a dynamicized version of the function F(x) that featured in the evaluation of 
the Fock self-energy. This latter expression must be analytically continued back to 
the result we want for real q° (see Mahan, Chapter 5 for the details). However, 
for our purposes we will limit our attention to the important limit q° —> 0, where 
E(iq°/4:Ep, \q\/2p+) reduces to F(\q\/2p-p). 


E(y,x) = l+- 


, i y n2 

1 — lx - 


In 


x-l + 1 


x — - — 1 


1 

4x 


i / , V' 2 

1 - [x + - 


In 


We will extract three great results from the RPA by examining three 
special cases of e(q°,q). 


20 

Dropping the infinitesimal in the de¬ 
nominator, since we won’t use it fur¬ 
ther. 
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Case I: In the static (q° —> 0) limit we have 7r(0, q) = 

— \g{E-p)F(\q\/2p F ). Particularly illuminating is the static and small 
q limit, where F(\q\/2q F ) -> 2 and lim q ^ 0 7r(0, q) = -g{E F ) = 

(where we have included a factor of 2 to account for spin degeneracy) 
and so 


lim e(0, q) 
q *0 


1 + 


/ 3?re 2 \ 
\2eo-E-F ) 


1 

W 2 


= 1 + 



(43.64) 


21 

The order of the limits is crucial here. 
Here we have started with the static 
limit and examined what happens as 
q —> 0. The other order, correspond¬ 
ing to considering a uniform field (i.e. 
q = 0) then taking the limit —> 0 
tells us about the response of a metal to 
ac fields and hence its transport prop¬ 
erties. 

F e ff(0, |g|) 



Fig. 43.15 The static effective poten- 
tial Iim|,|_ 0 V eff (0, q). 

Kff(°> M) 



Fig. 43.16 The static effective poten¬ 
tial i4ff (0, |cc|) given by the wave vector 
behaviour close to pp* 

22 

These are known as Friedel oscilla¬ 
tions after Jacques Friedel (1921- ). 

23 n 

Strictly one should consider the real 
part of Tc(q) and so this derivation 
should be regarded with caution! 


where #tf is known as the Thomas-Fermi wave vector. Notice that the 
permittivity diverges as |qr| —> 0. Perversely this is good news! It says 
that a uniform electric field can’t penetrate a metal, which is something 
we certainly expect. 21 Plugging e(0, q) into eqn 43.59 we see that the 
static interaction potential is changed to an effective potential 

linoi V e ff(0, |<?|) = ~~tt i . * 2 , (43.65) 

q^o eoV \q\ z + 

which is shown in Fig. 43.15. Notice that this is well behaved as |q| —> 0. 
We started this section by arguing that the RPA was going to tell us 
about screening. The connection with screening arises because our result 
for V e ff is the Fourier transform of the real-space interaction potential 

e 2 

lim V e s(x) (x — 6 -9tfM (43.66) 

|aj|—>oo |*| 

which is, of course, also known as the Yukawa potential, perhaps the 
simplest screened Coulomb potential. We conclude that the creation and 
annihilation of electron-hole pairs screens the electronic charge and in 
the RPA approximation this screening results in the Coulomb potential 
being replaced by an effective Yukawa potential. 

Case II: In the static limit for nonzero q the infinite slope of F(x) at 
x = 1 means that e(0, q) has a weak singularity at \q\/2p F = 1. Near this 
point 7r(0, q) a (|q| — 2pF)hi(|qr| — 2 p F ). When inserted into eqn 43.59 
and Fourier transformed this leads to an effective potential 

P eff (0,|*|)a COS(2 ^ |a:|) , (43.67) 

|*| 

shown in Fig. 43.16. This long-range, oscillatory potential reflects the 
fact that the metal as a whole reacts to the presence of each charge. The 
sharpness of the Fermi surface causes this reaction to ‘ring’ in space, seen 
as the oscillations in the effective potential of an electron. 22 

Case III: Finally we examine a case with nonzero q° in the limit 

!<?!-> o. 


Example 43.11 

In this case we may expand 23 the Lindhard function for small q: 

Np-\-q N p ^ q ■ Vp f d N p \ 

Ep-\-q Ep q® q ■ Vp q® \ d Ep J 


(43.68) 
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where v p = V p E p . Expanding in powers of momentum, and including a factor 2 
for spin degeneracy, we have 


lim 7r(g°. q) ~ — 2V 
kho 


f d 3 p 

q v p (q-Vp) 2 ' 

/diVp\ 

/ ( 2 tt ) 3 

[ q° (-7 0 ) 2 J 

Vd E p ) 


= 2V 


/ 


d|p|d(cos0)d0 

(2tt) 3 


IpI 2 


|q||Up| cos 8 /|q||u p |cos0 


(43.69) 


i-rtfdPl-PP), 

IpI 


where we’ve used the fact that y dE P J = — j^S(\p\ —pp). The integration over angle 
kills the first term and we obtain from the second that 


lim 7T (q°,q) 
Q l-^O 


g( E f ) 




(43.70) 


Inserting into our expression for the permittivity yields 


lim e(q°, q) = 1 — 
kho 


UC 


(q°y 


(43.71) 


9 2 

where u> p = is known as the plasma frequency. The fact that 
lim| q |^o e(q°, q) vanishes at q° = lu p is telling us about the presence of 
a new, long-wavelength, mode of excitation in the system. This is the 
plasma oscillation and corresponds to the entire system of electrons 
oscillating with respect to the positive background. 



Fig. 43.17 The oyster diagram within the RPA. 


Finally we mention the reason we took the diversion into the RPA in 
the first place: the correction to the excitation energy E p . This is ob¬ 
tained from the RPA by inserting the renormalized interaction shown by 
the shaded blob in Fig. 43.14 into our Feynman diagrams. For example, 
the corrected oyster diagram is shown in Fig. 43.17. The evaluation of 
the numbers is, as might be imagined, quite involved. It may be shown 
that the quasiparticle energy is predicted to be 

E„ = P - 0.166r s (ln r s + 0.203)^^ + const., (43.72) 

2m 2 to 

which leads to a nonzero effective mass. 

The insights provided by the RPA allow us to draw a diagram of the 
dispersion relation of excitations of the metal, shown in Fig. 43.18. The 


(a) 

E 



Fig. 43.18 (a) The excitation spec¬ 

trum of a metal showing low-energy 
particle-hole quasiparticle excitations 
and the higher energy (collective) plas- 
mon mode. A quasiparticle excitation 
with energy E p may be created with: 
(b) a momentum transfer |p|, from one 
side of the Fermi surface, and (c) ~ 
|p| + 2pp from the other (and all mo¬ 
menta in between) leading to the con¬ 
tinuum of states shown in the shaded 
region. 
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The many-body problem and the metal 


low-energy excitations are quasielectron-quasihole pairs, while at higher 
energies we see the plasmon mode. Immediately noticeable is the fact 
that the low-energy excitations form a continuum of width 2 pp. This 
arises because excitations involve promoting an electron from within a 
Fermi sphere which has a diameter 2pp- For a given excitation energy 
we therefore have a range 2pp of possible momenta, with the limits 
corresponding to forming the excitation from filled states on the nearest 
and furthest points on the Fermi sphere, as shown in Figs. 43.18(b) 
and (c). 


Chapter summary 

• A treatment of interactions in metals has been introduced at the 
level of the Hartree-Fock approximation. The Hartree term gives 
the energy due to the constant part of the potential and is repre¬ 
sented by a tadpole diagram. The Fock term describes exchange 
and is represented by an oyster diagram. 

• The next correction to the Hartree-Fock approximation is the 
RPA, introducing a bubble into the interaction line due to virtual 
electron-hole pairs and this allows us to treat screening. 


Exercises 


(43.1) 


Verify that , where the right- ( 43 - 3 ) Including spin leads to a many-body Hamiltonian 

J 47T 47T€o r s ’ <=> » , r 

hand side is in units of Rydbergs per electron. tor electrons ot 


(43.2) (a) Using the Hartree-Fock expression for the quasi¬ 
particle excitation energy in eqn 43.37, show that 
the Hartree-Fock effective mass is given by 


m 

m* 


e 2 m 1 /1 + x 2 
2npv x 2 \ x 


1 + x 
1 — x 



(43.73) 


with x = \p\/pf and show that this diverges at 
x = 1. 

(b) Using the RPA expression for the quasiparticle 
excitation energy in eqn 43.72 show that the RPA 
effective mass is given by 


— = 1 - 0.083r s (lnr s + 0.203). (43.74) 

m* 


2 

H = ^ (43.75) 

per 

+ 2 I>jfflp_q (T aj !+9(T /afcCT'hp< T . 

pkqc tu' 


(a) Why does the kinetic energy term favour equal 
numbers of spin-up and spin-down electrons? 

(b) Show that the Hartree contribution to the total 
energy is independent of the relative populations of 
spin-up and spin-down electrons. 

(c) Show that the Fock term gives a negative con¬ 
tribution to the energy for electrons with like spins 
and no contribution from the interaction of elec¬ 
trons with unlike spins. 

(d) Under what circumstances will this system be 
a ferromagnet? 
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(43.4) From the definition of the free electron-hole prop¬ 
agator 

G 0 (x,y) = (0\T^(x)^(y)\0), (43.76) 

along with the electron-hole mode expansion, show 
that the electron-hole propagator is given by 
eqn 43.46. 


self-energy. 

See Mattuck for help. 

(43.7) Consider an electron gas in (l+l)-dimensional 
spacetime. 

(a) Show that the equation of motion for electrons 
near the Fermi energy may be written 


(43.5) Use the fact (which was proved in Chap¬ 
ter 22 . 2 ) that (U(oo)|fl(—oo)) = (0|S'|0) = 

„ ( Connected \ 1 , 
exp > .. to show that 

\ vacuum diagrams J 


(43.77) 


^ ( Connected \ 

E ^ \ vacuum diagrams / 

~'v - —- W --• l 43 - 77 ) 

Notice that dividing by the volume and time VT re¬ 
moves the factor of (x = 0 ) that we showed in 
Exercise 19.4 accompanies all vacuum diagrams. 


(43 78) 

where the + sign is applied for electrons moving to 
the right and the — for electrons moving to the left. 

(b) What is the Lagrangian for this system of left- 
and right-moving electrons? 

(c) Using the (1 + l)-dimensional representation of 
the 7 matrices from Exercise 36.5, 7 0 = o 2 and 
7 1 = icr 1 , show that the Lagrangian may be written 


(43.6) (a) Write down the amplitude for the pair-bubble 
diagram in Fig. 43.11 and show that it varies as 
f which is divergent at small q. 

(b) By considering the other diagrams containing 
two interaction wiggles, show that the pair-bubble 
is the most divergent diagram at second order. 

(c) What is the most divergent diagram at third or¬ 
der? 

(d) Show that the RPA corresponds to a summa¬ 
tion of the most divergent diagrams in the electron 


£ = iV, t (|-u F a 3 ^)v>, (43.79) 



(d) Employing units where vf = 1, show that this 
may be written the form of a massless Dirac La¬ 
grangian. 

This problem is discussed in more depth in Zee, 
Chapter V.5. 
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I’ll use super-mathematics/ One bean weighs 1/20 of an 
ounce! The jar weighs 12 pounds! Allowing two pounds for 
the jar, that makes 20 times 16 times 10 ... or 32,000 beans/ 

Next request, please! 

Superman (1938-1992, 1993- ) 

Upon cooling, it is found that many metals undergo a transition to 
a superconducting state where all magnetic flux is expelled from their 
interior and their electrical resistance is zero. Experiments have probed 
the excitation spectra of these materials, which are found to have an 
energy gap A per particle that separates the ground state from excited, 
quasiparticle states. In this chapter we will use quantum field theory to 
describe this phase of matter. 


44.1 A model of a superconductor 


Fritz London (1900-1954) 


^By ‘breaking global symmetry’ we 
mean that the ground state picks out a 
unique direction for the phases to point 
for the entire system. 

2 

The electron-phonon interaction is 
quite similar to the electron-photon in¬ 
teraction in QED, with the interac¬ 
tion consisting of an electron emitting a 
phonon, rather than a photon as shown 
in Fig. 44.1. 



phonon 


Fig. 44.1 The electron-phonon inter¬ 
action vertex. 

Herbert Frolich (1905-1991) 


Like superfluidity, superconductivity involves a dissipationless flow of 
particles. In the late 1930s, Fritz London recognized that both effects 
involve the condensation of particles into a macroscopically occupied 
state. We saw that in the case of superfluidity this went hand in hand 
with the spontaneous breaking of a global t/ (1) symmetry. This will 
also turn out to be the case for superconductors. However, since the 
particles in a superconductor are charged (they must be since they carry 
electrical current), we are motivated to look for a theory which includes 
electromagnetism. Electrodynamics is a gauge theory which is invariant 
with respect to local U( 1) transformations; it is the global breaking of 
this symmetry 1 that leads to superconductivity and the expulsion of flux 
and measured dispersion follow as consequences: 

A superconductor is a state resulting from breaking global phase 
symmetry in a system with local U{ 1) invariance. 

Which particles should we be considering as the constituents of the su¬ 
perconducting state? Although originally it was assumed that these 
were the electrons of a metal, this is incorrect. The superconductor is 
built, not from electrons, but from pairs of electrons. In 1949 Herbert 
Frohlich formulated the theory of the electron-phonon interaction 2 and 
this allowed the possibility of an attractive interaction between electrons 
mediated by phonons. This interaction between two electrons looks like 
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a ^-channel electron-electron scattering process as shown in Fig. 44.2. 
What makes an attractive interaction a realistic proposition is the vast 
difference in time scales between the moving electrons and the move¬ 
ment of ion cores. Electrons move at the Fermi velocity and so the time 
scale over which they move through the crystal is E p 1 . The time scale 
of the deformation of the lattice is set by the phonon Debye frequency 3 
as Wp 1 . Since Eff 1 <C then the original electron has moved well 
clear by the time the phonon finds the second electron. In the field 
theory picture, this difference in time scales corresponds to an effective 
electron-electron interaction that is strongly ‘retarded’, which is to say, 
strongly dependent on the phonon frequency. 



Fig. 44.2 The ^-channel process in a 
superconductor. The effective interac¬ 
tion is mediated by a virtual phonon. 


Example 44.1 

We can piece together the form of the effective electron-electron interaction. By 
analogy with QED, the electron-phonon interaction vertex will be that shown in 
Fig. 44.1 with a coupling constant k,. The effective electron-electron interaction will 
be given by the t-channel process, mediated by the phonon, which has a propagator 
resembling the free propagator for the scalar field Do ( q ) oc (a; 2 — a; 2 ) — 1 , where u q 
is the dispersion of the phonons. This results in an effective interaction between 
electrons given by 

V e &(u,q) oc —-. (44.1) 

Taking the characteristic frequency of the phonons as the Debye frequency cj q « cdd > 
we deduce that the interaction is attractive for u ; < cjp. Leon Cooper showed more 
rigorously that an arbitrarily small electron-electron attraction would lead to pairing 
of electrons into bound states and that this pairing would occur between electrons 
across the whole of the metallic Fermi surface, making the entire metal unstable 
to the formation of pairs! Specifically, Cooper showed that there is an attractive 
interaction between two electrons with equal and opposite momenta and opposite 
spins. This is mediated by phonons and occurs chiefly between electrons within an 
energy uj-£> of the Fermi energy Ep. 


We will explore the consequences of the existence of these so-called 
Cooper pairs of electrons further by using an effective Hamiltonian 
given by 4 

H = ^^£ p Cp a c p( j — k 'y ( C-p|CpT, (44.2) 

per pk 

where cr(=t, |) labels the electron spin. This Hamiltonian describes 
the attractive potential energy between pairs of electrons, with opposite 
spins and momenta, whose energies will be taken in a shell of width ud 
at the Fermi surface. 5 

This Hamiltonian in eqn 44.2 is the model that was used to solve the 
problem of superconductivity by John Bardeen, Leon Cooper and Robert 
Schrieffer, known colloquially as the BCS model. Perhaps the most 
significant feature of the BCS model is not the Hamiltonian itself, but the 
method used to solve it. Schrieffer invented a trial many-particle state 
I^bcs)) known as the BCS wave function, which ingeniously captures 
the crucial physics. The BCS wave function is a coherent state whose 
p = 0 mode is macroscopically occupied when the pairs condense to form 


Peter Debye (1884-1966) 

Q 

The Debye frequency cjd is the max¬ 
imum energy that a phonon is allowed 
to take. Any larger and the phonon 
has a wavelength so small that it begins 
to explore length scales smaller than 
the distance between atoms and the no¬ 
tion of a phonon as a long-wavelength, 
collective excitation of a lattice breaks 
down. Phonons move at around the 
speed of sound while electrons veloci¬ 
ties are typically a factor of 10 2 larger. 


Leon Cooper (1930- ) 


^Comparing our notation from Chap¬ 
ter 42, one can identify ac 2 and 
5 

Note that this form of the poten¬ 
tial was motivated by the mean-field 
approach of Chapter 43. There we 
saw that the mean-field Hamiltonian 
for electrons in metals throws up terms 
such as a contribution to the energy of 
Co = {Cp^ q c\ +q ){c k c p ), which we dis- 
carded for an ordinary metal since or¬ 
dinary single-electron states give zero 
for each expectation value. This is not 
the case for Cooper pairs of spin- 4 elec¬ 
trons. 

John Bardeen (1908-1991) is the only 
person to have been awarded the Nobel 
Prize in physics twice. 

J. Robert Schrieffer (1931- ) 
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the superconducting ground state. It is used as a variational state with 
the model Hamiltonian to find the ground state energy of the system. 
We therefore start by building the BCS coherent state function. 

44.2 The ground state is made of Cooper 
pairs 

The building blocks of the superconductor are Cooper pairs. These are 
pairs of electrons with opposite spin and momenta. We can create a 
Cooper pair from the vacuum |0) using the operator 

pl = iApi- ( 44 - 3 ) 

The BCS coherent state is built from the vacuum using this operator as 
follows: _ 

i*Bcs>=n^ e ^i°>- ( 44 - 4 ) 

p 

The superconducting state is often described in terms of Cooper pairs of 
electrons in some sense ‘becoming’ bosons which subsequently undergo 
Bose condensation. This can’t be quite right, since the commutation 
relations for the pair operator aren’t those of a boson (see box). The 
pair operators do, however, commute with each other. Remember that 
the & p operators create fermions which obey the exclusion principle and, 
in particular, we have 

HH = CpT a -Pl 5 PT e ~Pi = °- ( 44 - 5 ) 


6 It’s important to notice that |^bcs) 
is very different from the usual situa¬ 
tion in a metal in which the wave func¬ 
tion is a single specific configuration of 
empty and filled states. The product of 
terms ( u p -\-v p Pp ) ensures that |^bcs) 
contains a coherent sum containing all 
possible configurations in which every 
pair state is filled or empty. The BCS 
state combines all possibilities in a sin¬ 
gle wave function. 

y 

See Exercise 44.3. 

^Similarly (N p i) = |u p | 2 . Of course, 
\u p \ 2 = 1 — \v p \ 2 tells us the aver¬ 
age non-occupation of the state (since a 
fermion state can either be unoccupied 
or singly occupied). 


Example 44.2 

The fact that (Pp) 2 = 0 gives us enough information to simplify the BCS coherent 
state. We notice that only the first two terms of the exponential can be nonzero 
(since they contain no powers and one power of P p respectively) and we have 

i4>bcs> -n<v^i°>=n c p i 1 + a p p i) i°)- ( 44 -6) 

V P 

We can also normalize the wave function: 


1 = |C P | 2 (0| (l + a* p p p ) (l + a p p;) 10} = \C P \ 2 (1 + |q p | 2 ), 
giving C p = (1 + |o p | 2 ) _ 2 . Defining 

1 a P 

Up — j-, Vp — j-, 

(l + |a p |2)2 (l+|a p |2)2 

we have that \u p \ 2 + \v p \ 2 = 1, we can write 6 

I’J'bcs) = ]T n p+ ,; p^ > p)i 0 )- 


Finally we can give meaning to v p . If we evaluate the ground state expectation value 
of the number operator for spin-up electrons, we obtain 7 

(JVpt) = <4'BCs|Cp T c pT |'I'BCs) = K’pl 2 ’ (44.10) 

which tells us that \v p \ 2 gives the average pair occupation of a state labelled with 
momentum p with spin up. 8 
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Written in terms of our single-fermion operators we have a simplified 
expression for the coherent state given by 

I^bcs) = + VpT 6 -Pi)l°)' (44.11) 

P 

The number operator N is given by 

N = E = E (Et + fipl) ■ ( 44 - 12 ) 

pa p 

Thus the total number of electrons in the superconducting state is found 
by summing over spin-up and spin-down occupancies and so 

(v&bcsI^bcs} = E (<^PT> + <^Pi>) = 2^ M 2 . (44.13) 

p p 

We now have enough ammunition to attack the variational problem of 
finding the ground state energy. 


44.3 Ground state energy 


We will now use the BCS ground state as a variational wave function for 
the model Hamiltonian (eqn 44.2) describing the superconductor. By 
minimizing the energy with respect to the parameters u p and v p we will 
extract the properties of the ground state of the superconductor. The 
expectation value of the energy is 


E = (4 , BCs|-ff I'E'bcs), (44.14) 

which, upon substitution, 9 gives us 

E = E 2e P lPp| 2 - k 2 E v * P v k u *k u p- (44.15) 

p pk 

We will minimize this subject to two constraints. The first fixes the total 
particle number via 

N = 2j2\v P \ 2 , (44.16) 

P 


9 The first term is easy since it is the 
kinetic energy term in the BCS Hamil¬ 
tonian (eqn 44.2). It can be written 

T epNpa, 

pa 

and so we can use eqn 44.13 to reduce 
it to 

2e p \v p \ 2 . 

p 

The second term is evaluated in the ex¬ 
ercises. 


which we impose as a constraint using a Lagrange multiplier 10 fi. We 
also introduce a Lagrange multiplier E p to enforce the constraint that 
\u p \ 2 + M 2 = 1. 


10 As will be obvious to those with ex¬ 
pertise in statistical mechanics, the La¬ 
grange multiplier p is the chemical po¬ 
tential. The Lagrange multiplier E p 
will turn out to give the quasiparticle 
energy. 


Example 44.3 

Using the method of Lagrange multipliers, we write a function / using 
f = E _ pN + Y / E p (\u p I 2 + \v p \ 2 - 1), 

P 


(44.17) 



404 Superconductors 




-4A -2A 0 2A 4A 



and look for solutions to df/du p = 0 and df/dv p = 0. This yields 
dE ON 

- p— -h E p u p = 0, 


du_ 
dE 
dv p ^ dv p 


du p 

dN 


- E p v* = 0. 


(44.18) 


Doing the derivatives we find, after a little algebra (examined in Exercise 44.5), the 
matrix equation 


(e p - p) A 

A* -(e P - p) 


•: I = e.„ 


where we’ve written A = k 2 u*v p . This matrix has eigenvalues 

E p = ± [{e p - p) 2 + | A| 2 ] 5 . 

Finding the eigenvectors is made easier if we define 


cos 28 p = £p E M , sin 2 8 P = 


Then we find eigenvectors 


(44.19) 

(44.20) 

(44.21) 

(44.22) 

(44.23) 


The behaviour of |« p | 2 and v p | 2 is shown in Fig. 44.3. We also find that u p v p = 
cos Op sin 0 p = ' sin 28 p = A/2 E p . We conclude that the BCS ground state is 

t it 


Up = 

COS dp , 

v; 

= sin0p, 

Up \ 

1 

“ 2 

( 1 + 

£ P ~ fl\ 

E P ) 

\ v p\~ 

1 

(- 

£p- p\ 

_ 2 

E v ) 


I^BCS) = n p (cose P +sine p c^ T cL pl )|0) 


We will see in the next section that A is the energy gap between the 
ground state and excited quasiparticle states. We may now eliminate 
Up and v p to reveal the physics of the BCS state. Using UpV p = A/2E p 
with the definition of the gap A, we find our first result which is that 


A 


A = k 2 V-. 

^ 2 E n 


(44.24) 


Fig. 44.3 (a) The function E p (which 
we will later call the bogolon disper¬ 
sion). (b) The functions \u p \ 2 and 
| r/p | 2 which show that well below fj, the 
state is predominantly electron-like and 
well above it is predominantly hole-like. 
Near to fi it has a mixed electron-like 
and hole-like character, (c) UpV p = 
A/2 E p , which has a maximum when 
there are plenty of electron states and 
available states to scatter into. 


Since the electrons that form the pairs only lie within of the Fermi 
energy, we can restrict our sum to |(£ p — /t) | < wd- We then make the 
replacement —» <7 (ef) / ds, where g(e p) is the density of states at 

the Fermi energy, and writing e = e p — fj,, eqn 44.24 becomes 

A = K 2 5(ep)/ de—— —r, (44.25) 

'- v - 'J-UD 2 {£ 2 + A -)2 

A 

where we have defined A (the effective coupling constant) using A = 
n 2 g(£ f). This integral gives 




(44.26) 
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but since A <C wd we obtain 

|A| « 2w d e _1/A . (44.27) 


This tells us how the energy gap that promotes superconductivity de¬ 
pends on the effective coupling and the Debye energy. Superconductivity 
is most robust if there is a large energy gap and so, if we want to find the 
most robust superconductor with the highest possible 11 critical temper¬ 
ature T C1 we need a material with a large Debye frequency and a large 
coupling constant A. The latter is achieved by having a large k and a 
large density of states at the Fermi energy. 


11 This assumes BCS-like electron- 
phonon pairing. A higher T c may be 
found using other mechanisms. 


44.4 The quasiparticles are bogolons 


We now make good on our claim of the last section: we want to confirm 
that A is indeed the energy gap in the particle spectrum of the super¬ 
conducting ground state. To find the excited states we employ a similar 
approach to that of the metal. We consider the model Hamiltonian with 
averaged pairs of operators in the interaction part: 

H = ^^(£ p — fJ.)Cp a C pcr — K ((Cp|cL p |)C-p|Cp| + (C-p|C p |)Cp|C^ p |^ , 

per p 

(44.28) 

where the expectation values are taken with respect to the BCS ground 
state |Decs)- Substitution of |Obcs) allows us to identify the quantity 
K2 Ep(^BCs|c-p|Cp T |fi B cs) as the energy gap 12 A = K 2 J2 p UpV p , and 
we have 

H = y ( (gp — fj,)Cp a Cp a — y ' (A c_p^Cp| + Ac p ^cl p j) • (44.29) 

per p 

We need to diagonalize this Hamiltonian, which contains cc and 
bilinears. As in the last chapter, the Bogoliubov transformation gives 
us a method to do this. 


12 

In manipulating these expressions in¬ 
volving the energy gap, note that if A 
is not real then, writing A = |A|e 1<?;> , 
we may always make it real with the 
global transformation di —► e 1<?f> / 2 Ci, 
cj —> In the next section 

i x 

we will confirm that choosing a particu¬ 
lar value for A corresponds to breaking 
symmetry. 


Example 44.4 

Written in matrix form we have the Hamiltonian 13 

-A 


* = £Ut 


-pi 


£p [l 

-A* ~(e p - p) 


C P ] 
£+ . 

“Pi 


(44.30) 


This may be diagonalized using the Bogoliubov procedure, whose details are exam¬ 
ined in Exercise 44.7, to give 


*-£K, 


-pi 


( E r 

° ^ 

( > T ) 

V 0 

-E p ) 

V ^ J 


(44.31) 


where the operators bp a and b pa create and destroy bogolon quasiparticle excitations. 
The bogolon operators obey anticommutation laws: 

|Spicri , } = ( ^PlP2^ cr l cr 2 5 (44.32) 

{fycqcri , bp 2( t2 } = 0, {> ^p 2 ^2 } = 0. 


13 This looks a lot like eqn 44.19 but 
with minus signs on the off-diagonal el¬ 
ements. 
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^This is key to superconductivity. Ex¬ 
citations will dissipate momentum and 
prevent the superflow of current. The 
presence of a gap prevents their cre¬ 
ation at low energies. 


M M t 

Miff 
M M t 

f f f t f 
M M t 

Fig. 44.4 Order in the superconductor 
involves the lining up of phase angles, 
just as in the case of an uncharged su¬ 
perfluid. 


The diagonalized Hamiltonian for the excitations in the superconductor 
is given by 

U — ^-pi k-p|)i (44.33) 

p 

where E p = \J(e p — p) 2 + |A| 2 . This is shown in Fig. 44.3. Notice that 
the bogolons have a dispersion relation that p starts at an energy A 
above the ground state energy. This is the energy gap: 14 there are no 
allowed quasiparticle excitations between E p = —A and E p = A. 


44.5 Broken symmetry 

As in the superfluid state examined earlier, superconductivity involves 
the macroscopic occupation of a coherent ground state Q). Recall that 
the order parameter for the superfluid was given by 

<fi|*(*)|n> = -^^(0|a p |0)e^ 

V p 

« —j=(fi|a p= o|H) = Vne' 8 °. (44.34) 

V V 

For the superconductor we have (by the same token) 

(n B cs\*(x)\n B cs) = ^=^(flBCs|M pi c pT I^BCs)e i(p - p) ' 3: 

V P 

= — 7 = (^BCS | C-pJ.C pT I ObCs) 

v p 

= y/ne ie °. (44.35) 

From these expressions we see that we could also regard the supercon¬ 
ducting energy gap as the order parameter for the system since we have 

A = k 2 ^(fi B cs|c-pj.CpTl^BCs)- (44.36) 

p 

Returning to the real space expectation value (ip(x)\i>(x)\’ilj(x)) = 
\Jp(x)e l9 ^ for a coherent state \ip{x)), we see that for the supercon¬ 
ducting ground state |Qbcs) has the phase field 9{x) breaking symmetry 
by becoming a constant 9 0 . (As in the case of the superfluid we can vi¬ 
sualize this with an W-model, as shown in Fig. 44.4.) As alluded to 
before, the difference between superfluidity and superconductivity comes 
from the fact that the phase symmetry that’s broken is a local one. Prior 
to breaking symmetry the superconductor enjoys local U(l) symmetry: 
we would alter the phase by 9(x) —» 0(x) + a(x), where a(x) is differ¬ 
ent at all points in spacetime and the Lagrangian describing the system 
would remain invariant. A gauge field is needed in the Lagrangian to 
guarantee this. We have seen in Chapter 26 that the gauge field will 
consume the Goldstone boson and become massive. This is the Higgs 
mechanism and occurs in superconductors! We will examine the Higgs 
phenomenon in superconductivity in the next section. 
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44.6 Field theory of a charged superfluid 

The superfluid Lagrangian is invariant with respect to the global trans¬ 
formation —> e IQ 'I'. We now promote this global symmetry to a local 
one. As we have seen, this requires the introduction of a gauge field A M 
through the replacement of the derivative d M by the covariant deriva¬ 
tive Dfj, = dp + i qA^. The coupling constant q is called the charge of 
the 4' field, so by making a local phase transformation we are studying 
the physics of a charged superfluid. Starting with the non-relativistic 
superfluid Lagrangian (and denoting the coupling by g = 2 Vk 2 ): 

C = i4' t <9 0 4' - — • V4- - -{n - 4' t 4') 2 , (44.37) 

2m 2 

we gauge the theory by introducing the covariant derivative D M . This 
will be easier in polar coordinates, so we introduce 4/ (x) = \Jp(x)e 10 ^ 
and = 1 A^=d fJl p + i^fp (d^d + qA^) e lS to obtain the gauged La¬ 
grangian 

C = - p(d 0 e + qA 0 )-A- ±-(Vpf+p(y9-qAf 

-f (44.38) 

where, as before, we drop the total time derivative term idop/2. Note 
that we have included the contribution of the gauge held via = 
d^A v — dvAfj,. The Lagrangian is now invariant with respect to the 
change 9(x) —> 6(x) + a(x) provided A^(x) —> A^{x) — ^d^aix). 

As we’ve seen before, when we break the symmetry we’ll find that 
the massless photon held A fJ/ (x) gobbles up the Goldstone mode 9(x) 
and acquires a mass. This is the Higgs mechanism, at play in a charged 
superfluid. 


15 n 

See Example 42.4 for the simpler ver¬ 
sion of this argument appropriate for 
the superfluid. 


^Remember, you don’t break the sym¬ 
metry of a Lagrangian; that always re¬ 
tains the full symmetry. We simply ex¬ 
pand the Lagrangian around the broken 
symmetry ground state. 


Example 44.5 

Here’s how it unfolds. 15 We notice that we can cast our Lagrangian in terms of a 
gauge invariant contribution C^{x) = ^d^O(x) + A^(x), and the contribution of the 
gauge field is = d^A v — d^A^ = dpC v — d u C^. We have 


£ = - pqCo - W Wp) + pq C \ - l (n - p) - i F ^ 

Now break the symmetry of the ground state. 16 We choose a ground state \I/o(#) = 
y/ne 1 ^ 0 , where we’ll make our usual choice of 0q = 0 as shown in Fig. 44.4. We 
expand about the ground state by examining small displacements yfp = \Jn + h, 
which yields 

C = -'^-C 2 - — (V/i ) 2 - 2 gnh 2 - 2 qy/hC 0 h - -F^F^, (44.40) 

2m 2m 4 

dropping the term —qnCo as we did for superfluids. Already notice that the C'-field 
appears to have acquired a mass (nq 2 /m) 2 . To complete our calculation we should 
integrate out the energetic field h to obtain a low-energy field theory. Just as in 
Chapter 42, we recognize that the h dependent part can be written in the form 


(44.39) 


-h -V' 

V 2m 


+ 2 gri \ h — ( 2qy/nCo)h . 


(44.41) 
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17 

See Chapter 13 if in doubt. 


Again, just as before, we will integrate out the field h with our path integral via 
— ^<j)K<j) + J(j) — > Setting </> = —y/2h and J = \Z2nqCo we have 


C = 


^2Slc 2 + qV^Co 

2m 


(—1/2 m) V 2 + 2(7n 


- -F^F^ 


1 


n 9 2 „2 -p p^ 

- - 4 ^ 

= ^ (c 0 2 - U 2 C 2 ) - 


(44.42) 


where v 2 = ng/m and we have neglected the term (l/2m)V 2 as small compared 
to 2 gn. This is a non-relativistic equation telling us that the fields in a charged 
superfluid give rise to excitations which are spin-1 vector particles. 17 


See Exercise 44.8. 


The consequence of the Higgs mechanism in a superconductor is the ex¬ 
pulsion of magnetic flux from the interior of the material. This is known 
as the Meissner effect. Starting with the low-energy Lagrangian from 
eqn 44.42 we recognize that we can eliminate the component Cq (since a 
massive vector field has only three degrees of freedom, see Chapter 13). 
We may therefore integrate out Co (in the same manner as above) with 
the result that 


C = -^C 2 
2m 


Terms involving 
derivatives of C 


(44.43) 


Notice that the three massive fields described by this equation have mass 
M 2 = nq 2 /m. 

Our low-energy Lagrangian in eqn 44.43 may be used to find the 
current in the superconductor, which is given by 

J = — (VO - qA). (44.44) 

m 

We conclude that the current in the superconductor responds to gradi¬ 
ents in the phase field (just like the superfluid) and also to the applied 
field A. 


This is one of the London equations; 
the other is 

dJ _ nq 2 

~dt ~ ’ 

which also follows from eqn 44.44 as¬ 
suming V0 has no time-dependence. 


Example 44.6 

Upon taking the curl of the current, we obtain an equation describing the magnetic 
induction field B in the superconductor: 18 


VxJ=-^B, 

m 

which, combined with the static Maxwell equation V X B = J, yields 

V 2 S= n 1 2 B 


(44.45) 


(44.46) 


In our units, the quantity nq 2 /m must have units (length) 2 , so we define A = 
(m/nq 2 )- 1 / 2 . 

In one dimension eqn 44.46 now reads 

d2 B = ±B 
dx 2 A 2 


(44.47) 
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whose solutions have the form B(x) = B(0)e~ x /^. In other words, the field in 
a superconductor falls away as we enter the interior of the superconductor over a 
length scale determined by A. Since A tells us how far magnetic flux can penetrate 
the superconductor it is known as the penetration depth. Our treatment has all 
been aimed at describing a charged superfluid. A superconductor is quite similar. In 
a superconductor we have Cooper pairs, so we conventionally set q = — 2|e| and write 
n s = 2 n and m = 2m* for the superfluid density and effective mass respectively. 
This gives a penetration depth 19 for a superconductor of 


1 _ n s e 2 
A 2 m* 


(44.48) 


^In SI units, 


p 0 n s e 2 

m* 


Chapter summary 

• The BCS wave function is given by I'E'bcs) = TIp( u p + 
w p4>T -pJ.)| 0) an< ^ a coherent state of Cooper pairs. 

• The quasiparticle excitations are bogolons, with dispersion given 
by E p = y/(e p - n) 2 + |A| 2 . 

• The superconducting energy gap A can be regarded as the 
order parameter of a superconductor, and is given by A = 
k 2 Sp(^BCs|c-pj.Cp T |fl B cs)- 

• The Higgs mechanism in a superconductor leads to the London 
equation, which can be written as J = — (V0 — qA). 


Exercises 

(44.1) Show that the amplitude 

(4 > BCs|Cp T cl pi |'l/BCs)(4/BCs|C-p|Cpt|4'BCs) 

(44.49) 

is nonzero. 

(44.2) Verify the commutator ^P p , Pjj = <5p q (l — JV p f — 

N- q |). 

(44.3) Verify (^ B cs|iVp T |^ BCS ) = \ Vp \ 2 . 

(44.4) Verify eqn 44.15: 

(E) = ^2 £ p|ap| 2 — k Y VpVkUkUp. (44.50) 

p pk 


(44.5) Starting with the total energy and total particle 
number written in the form 


e = J2 £ p (ki 2 - M 2 +1) - k2 J2 v p 


n = (i^pi 2 - m 2 +!) ■ 


pk 


VkUkUp , 


(44.51) 


derive eqn 44.19: 

(£p - n) A 
A* ~{e p -n) 


= E v 


(44.52) 

(b) Diagonalize the Hamiltonian and find |rtp| 2 and 
|up| 2 using the method suggested in the text. 


(44.6) (a) Define the Nambu spinor as i/) p = | 


pt 


-pi. 


and the energy gap as A = Ai — iA 2 . Show that 
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the BCS Hamiltonian can be written 

H = J2^ph p -Ti’ p , (44.53) 

p 

where h p = (Ai, A 2 , £ p — p) and r is a vector made 
up of the Pauli matrices. 

(b) Show that eqn 44.19 is identical to the Dirac 
equation upon making the replacements: 


ubov procedure. The transformation to use in this 
case is 



and the algebra is simplified by using the substitu¬ 
tions 


V'l 

IpR 

cr ■ p 

m 

This is the basis of Nambu’s analogy. 

(c) When local U( 1) symmetry is globally broken 
in the superconductor A takes on a nonzero value. 
What is the equivalent of this symmetry breaking 
in the Dirac problem? 

Nambu’s analogy is discussed further in Aitchison 
and Hey, Chapter 18. 

(44.7) We will find the excitations out of the BCS ground 
state by diagonalizing eqn 44.30 using the Bogoli- 


u p = cos 8 P , (44.56) 

iip = sin6* p . (44.57) 

(a) Check that the transformation maintains the 
anticommutation relations. 

(b) Verify that this does the job of diagonalizing 
the Hamiltonian. 

(c) Confirm that the bogolons are really the excita¬ 
tions by checking |Dbcs) = 0 and 6- p i|Hbcs) = 
0, that is, that there are no bogolons in the BCS 
ground state. 

(44.8) Show that the current in a superconductor is given 
by eqn 44.44. 


v > 

u„ 




"pi 
s p p, 

A. 


(44.54) 



The fractional quantum 
Hall fluid 


45 


The more success the quantum theory has, the sillier it looks. 
Albert Einstein (1879-1955) 

The application of a magnetic field to electrons in condensed matter has 
surprising consequences, none more so than the integer and fractional 
quantum Hall effects. The latter effect is a product of interactions, but 
there are some interesting features even in the non-interacting case which 
we review first. 

Throughout this chapter, it will be helpful to keep in mind that the 
relevant length scale in the problem is the magnetic length £bj given 



Note that 2 the flux $ through a circle with radius v^b is equal to 
<!> = B(2tt£q) = h/e. This is double the value of the magnetic flux 
quantum $0 = h/ 2e which is important in superconductivity (where 
the Cooper pairs have charge — 2e) and plays the role of a flux quantum 
for problems involved with unpaired electrons. It is often called the 
Dirac flux quantum. 


45.1 Magnetic translations 411 

45.2 Landau Levels 413 

45.3 The integer quantum Hall ef¬ 
fect 415 

45.4 The fractional quantum Hall 

effect 417 

Chapter summary 421 

Exercises 421 


^For ease of notation we follow conven¬ 
tion in condensed matter physics and 
write —e for the charge on the electron 
in this chapter. 

2 

The fact that we have to make the cir¬ 
cle have radius y/2Z& and not Zb is an¬ 
noying, but we will have to live with it. 
As we shall see, sometimes the relevant 
length scale is Zb and sometimes V2Zb • 


45.1 Magnetic translations 


Problems involving a lattice in solid state physics make use of transla¬ 
tional symmetry. Recall from Chapter 9 that the translation operator 
for a translation by a is e” lp a , where p is the momentum operator. For 
a periodic system we would expect e“ lpa to commute with the Hamilto¬ 


nian H. Writing U(a) = e lp a , we would then expect U(a),H 


= 0 or 


U~ 1 (a)HU(a) = H. This expectation is not realized in the presence of a 
magnetic field where we find that U (a) does not commute with H. This 
is because H has become a function of the magnetic vector potential A 
and this changes as you translate. We should therefore expect 3 


U~ l {a)H{A)U{a) = H(u~ 1 (a)A^ . (45.2) 


However, if the energy of the system is independent of global translations 
then this change in A can’t affect the magnetic held we would measure 
with a magnetometer. We have, of course, met such shifts of the vector 
potential before: they simply represent gauge transformations. 


q 

Treating the magnetic vector poten¬ 
tial as a function, rather than a second 
quantized field, we have U~ 1 (a)A(r) = 
e 1 ^ -0 A(r) = A(r — a). 
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^ Recall from Chapter 14 that a gauge 
transformation involves changes ip —> 
ipe 10L and A M — ► A ^ — -d^ct and we 
sometimes write \ = oijq. Note also 
that the covariant derivative D^ + 

i qA^ implies P = p — qA. In this chap¬ 
ter we will often deal with electrons, 
where q = —e. 


Example 45.1 

Consider 4 a uniform field B. In the gauge A(r) = lR x r we have 

U~ 1 («) A(r) = A(r — a) = A(r) — — B x a. (45.3) 

This is a gauge transformation A^ —> A^ — ^d^a(x) with JVafa;) = — ^B x a or 

I = x(r) = ~\(B X a) ■ r. (45.4) 

q 2 

Along with a shift in A /f . a gauge transformation also involves a shift of the matter 
field ip —» ipe la ^ which here is -0 —> ip exp [)f (-B X a) • r]. 


To summarize: in the presence of a magnetic field, we have a slightly 
unusual form of translational symmetry in that it is invariant under a 
combination of a translation with a gauge transformation. 

We saw in Chapter 14 that the momentum operator p M = id M doesn’t 
transform properly under gauge transformations, but the covariant ver¬ 
sion = iD^ = i(3 M + iqAn) does. It therefore comes as little surprise 
to find that an upgraded translation operator that does commute with 
the gauged Hamiltonian is given by 

U(a)=e~ ip - a , (45.5) 


^ Since we are discussing two- 
dimensional space here we will revert 
to the ordinary notation of vector 
components where momentum along 
the IE-direct ion is written p x and so on. 
We will return to four-vector notation 
in Section 45.4. 


^ This shows that the magnetic trans¬ 
lation operators form a non-abelian 
group. See Chapter 46. 


7 

The proof of eqn 45.8 follows from 


HKp*- 5 ! 4 ) 

•-T? (Pv + ^)] 

2iri(p 



and the use of the identity 

e^^eVe^s]. 


which holds if ^4, bJ commutes with A 
and B. See Exercise 45.1. 


where P = — iV — qA. 

To see some consequences of this formalism, we now return to con¬ 
densed matter physics and consider a two-dimensional square lattice 
(with lattice constant a) of non-interacting electrons in the presence of 
a magnetic field. The Hamiltonian is 5 

H = ^ (p x + eA x f + (Py + eA vf + V ( x > 2/)> ( 45 - 6 ) 

where V ( x, y) = V(x + a, y) = V(x,y + a ) is a periodic potential. 
With a uniform magnetic field parallel to 2 we can write the magnetic 
vector potential A = ^(— y,x,0). In this case the magnetic translation 
operators that translate us through one lattice spacing in the x- and 
y-directions are, respectively, 


(45.7) 

where we have restored factors of K. As advertised, these operators 
commute with H. However, they do not commute with each other and, 
in fact, 6 

uJjy = e 2 ^U y U x , (45.8) 

where (f> = ea 2 B/h is the dimensionless magnetic flux.' 


- 


i a 


eBy 

U x - 

= exp 

~~h 

I .Px - 

2 



i a 

(» 

eBx 

Uy = 

= exp 

~Ti 

l Py + 

2 
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The quantity <j) represents the number of flux quanta (h/e) passing 
through a square on the lattice (area a 2 , usually called a plaquette). 
Thus 

(45.9) 

where $ = Ba 2 is the magnetic flux through one plaquette. If <j) = p/q 
where p and q are integers then 


UlUy 


= o, 


(45.10) 


because (e 2,n<?i ) p = 1. This means that a rectangle consisting of q ad¬ 
jacent plaquettes contains an integer number of flux quanta. Thus, the 
system has an effective periodicity equal to q times what would be the 
case for B = 0. 

This idea is beautifully illustrated by the Hofstadter butterfly 8 
shown in Fig. 45.1, which shows the energy level spectrum for a two- 
dimensional tight binding model (recall Example 4.8) with a magnetic 
field perpendicular to it. When B = 0 (and consequently <j> = 0), the 
allowed energies satisfy — 4t < E < At as you would expect from eqn 4.50. 
For cf> = p/q, the allowed energies form q sub-bands (note that sometimes 
the sub-bands touch, so the two sub-bands at (f) = 1/2 meet at E = 0). 
The resulting band structure is fractal (and was published only a year 
after Benoit Mandelbrot coined the term). It has the extraordinary 
feature that the spectrum depends on whether <j) is rational or irrational 
(and can only be plotted for rational <f>). 


45.2 Landau Levels 


In order to examine the quantum mechanics of charges in applied mag¬ 
netic fields we return to the problem of two-dimensional free electrons. 
The Hamiltonian for this system may be written 


where 


- P 2 + P 2 

H = x v , 

2m 

(45.11) 

P x = Px + &-A-X 5 Py = Py + 6 Ay. 

(45.12) 


Classically, electrons with momentum p will orbit in circles of radius 
p/eB in a magnetic held. Consequently, we write the position operators 
of an electron as 


x = X + ±P V , y = Y — ±P X , (45.13) 

where (X, Y) are the coordinates of the guiding centre of the orbit. 
We find that (X, F) and (P x ,P y ) are independent coordinates because 


X. IL 


X,P„ 


Y , P r 


Y, P„ 


= 0 . 



4 


Fig. 45.1 The Hofstadter butterfly. 

^For more discussion of the Hofstadter 
butterfly see Douglas Hofstadter’s pa¬ 
per [Phys. Rev. B 14, 2239 (1976)] 
and his well-known book Godel, Es- 
cher, Bach , Chapter 5, which also fea¬ 
tures his take on renormalization. 

Benoit B. Mandelbrot (1924-2010) 
chose his own middle initial, which 
doesn’t stand for anything. There’s an 
old joke that says that the ‘B’ in ‘Benoit 
B. Mandelbrot’ stands for ‘Benoit B. 
Mandelbrot’, thereby making the name 
of the inventor of fractals a recursively- 
defined fractal. 


(45.14) 
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However, we also find that 


X,Y 


if 2 

“B! 


p p 

1 x i 1 y 


i hr 
HI 


(45.15) 


Thus the guiding centre cannot be determined more accurately than an 
area « i.e. the area occupied by approximately one quantum of flux. 


Example 45.2 

Operators X and Y commute with H and hence Yf- = -A- = 0. However, 


d P x 

d t 

1 

i h 

\Px, p] = —U)cPy, 


dPy_ 
d t 

1 

i h 

\Py, A] = Ul c Px, 

(45.16) 


where u c = eB/m is the cyclotron frequency. Thus (ignoring hats) 

P'x + u) 2 P x — 0, 

Py + uo^Py = 0, (45.17) 

and one can easily show that electrons perform circular orbits around (X, Y). 


As with many subjects in quantum field theory, the physics of this prob¬ 
lem may be described using the machinery of the simple harmonic os¬ 
cillator. To map this problem onto the simple harmonic oscillator we 
introduce the operators 


5 = fk (£■ - A) , s' = ^ (4 + iP„), 
'>=7fc(- t + ii> )’ S'= Tt (v - if) . 


(45.18) 


satisfying a, a) 



= 1 and 




= 0. The Hamiltonian 


then assumes the expected form: H = (d^d+ |) hu> c and so there are 
eigenstates | N), where N is the number of a-quanta in the oscillator. 
The energy levels of this system are called Landau levels. 

However, note that we can write the states 


|AT,n) = - ? i=(at) JV (Stno), (45.19) 

and so, even though the energy depends only on N, we shouldn’t forget 
about n. This contributes to the degeneracy and, in fact, from this we 
will now show that each electron occupies a real space area of tt£q. 


Example 45.3 

The density of states in momentum space is given by 


g(k)dk 


(2t r/Lfl 


(45.20) 
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where we write k = \k\ = ( k 2 + k 2 ) 2 5 the factor of 2 is for spin degeneracy and L is 
the linear size of the system. The radius kjy of the Nth Landau level in fc-space is 
found by writing 


H 2 k 2 N 

2 m 


= [N+ \ 


huj c , 


and hence 


k 2 - 
k n ~ 


1 + 2N 

—ft ' 


(45.21) 


(45.22) 


The area between two successive circles in /c-space tells us how much momentum 
space is taken up by each Landau level. It is given by 


AA k — ^{k 2 N+1 - k 2 N ) — -2", 


(45.23) 


where the factor of 2 again accounts for spin. We conclude that, in each Landau 
level, the number of states 9 is given by 

= _L^ 2 /2eB\ 


2tt 


1 


4 ( 2 Wi ) 2 


x 2 = —= L 

nil 


\ h y 


which has the form: 


Number of states in one Landau level = 


(Area taken up by all of them) 


(45.24) 


(45.25) 


(Area taken up by one of them) 

So the effective real-space area occupied by one Landau state within a Landau level 
is 7 r^g. 

However, we also need to discuss the effect of spin splitting. Each Landau level 
contains electrons in both spin states and a magnetic field will cause these to have 
different energies. The energy can be written 


E = (N+^)hw c ±^g* m B, 


(45.26) 


where g* is the effective g-factor. This effect will now double the number of levels 
(these are now spin-split half Landau levels) and each of these levels now has half 
the number of states in it. 10 The effective area occupied by one state in a spin-split 
half Landau level is then 11 2n£^. 


45.3 The integer quantum Hall effect 

The Hall effect depends on the number density of carriers and so our 
treatment of the integer quantum Hall effect will begin by considering 
how many states are available for occupation by electrons in a two- 
dimensional metal 12 when subject to a large magnetic field. 


Example 45.4 

A reminder of the physics of the conventional Hall effect may be found in Exer¬ 
cise 45.4. In brief, for an isotropic material Ohm’s law is written 


&XX 

&XV 


v xy 
& xx 


E x 

Ey 


(45.27) 


and the Hall coefficient is defined as Rn = (r xy /B. In experiments we measure the 
resistivities p X x and p xy , which are given by 

_ E x (7 xx 

pxx = 


jX 


7 2 
' XX 


+ (7 X 


pxy — 


7 2 
' XX 


+ cr x 
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^Thus the degeneracy of a Landau level 
is 2 eB/h = 1/) per unit area of the 
sample. 


^Thus the degeneracy of a spin-split 
half Landau level is eB/h = l/(27r£g) 
per unit area of the sample. 

Note that 2-7r£g also equals the area 
occupied by one Dirac flux quantum. 


12 r 

Recall from Chapter 34 that we ex¬ 
pect the conductance to vary as G(L) oc 
L(d — 2 ) ? where d is the dimensional¬ 
ity. For d = 2 we therefore ex¬ 
pect behaviour that is independent of 
the dimensions of the system, making 
the quantum Hall phenomena we will 
describe completely universal for two- 
dimensional systems. 


(45.28) 
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13 

See Singleton, Band Theory and 
Electronic Properties of Solids , for 
more detail on Hall effect physics. 


Fig. 45.2 The quantum Hall effect 
in a GaAs-(Ga,Al)As heterojunction, 
shown in resistivity components (a) p X x 
and (b) p xy , measured at temperatures 
between 0.03 and 1.5 K. (Figure from 
J. Singleton, Band Theory and Elec¬ 
tronic Properties of Solids, reprinted 
with permission.) (c) The fractional 
quantum Hall effect in a semiconduc¬ 
tor heterojunction, shown in resistivity 
components p X y (dotted line) and p xx 
(solid line). Figure from P. Gee, D.Phil. 
Thesis, Clarendon Laboratory, Univer¬ 
sity of Oxford (1997). 


^ Experimentally, this is accomplished 
by ensuring that electrons occupy the 
lowest sub-band in a quantum well in a 
semiconductor heterostructure. 


15 

The filling factor v counts the num¬ 
ber of spin-split half Landau levels. 
However, from now on we will simply 
refer to the spin-split levels as Landau 
levels. Whether the electrons interact 
with the magnetic field via their orbital 
(cyclotron) motion or via their spin an¬ 
gular momentum, the end result is the 
same: orbits with discrete energy lev¬ 
els; hence we will call these energy lev¬ 
els Landau levels, and v of them are 
filled. 


and in the high magnetic field limit we find \cr xy \ \cr xx \ giving 

pxx ~ <T f^, and pxy ~ -T = R h B. (45.29) 

a xy a xy 

For a conventional system we expect p xx to be a constant and p xy to increase linearly 
with field. As shown in Fig. 45.2, systems approximating a two-dimensional electron 
gas show something rather different with increasing field: 

• Plateaux in p xy occurring at values p xy = i with v an integer [Fig 45.2(b)], 

• Dramatic drops in p xx which takes very small values when we have a plateau 
in p xy [Fig 45.2(a)]. 

• Maxima in p xx in the transition region between the plateaux. 

This is known as the integer quantum Hall effect. 13 





The first step in studying the quantum Hall effect is to confine an elec¬ 
tron gas into a two-dimensional region. 14 A magnetic field is then applied 
perpendicular to this two-dimensional electron gas. As shown in Exam¬ 
ple 45.3 each spin-split Landau level has a number of states per unit 
area of sample equal to l/(27rtg). If the number of electrons per unit 
area is n, 2 d, then we define the filling factor v by 

(Areal density of electrons) 

(Areal density of states in a (spin-split half) Landau level) 

= n 2d 2 tt4 = (45.30) 

Be 

As a result, whenever the magnetic field takes a value 


n 2 dh 

e 


1 

_ ? 
P 


(45.31) 


where p is an integer, then the filling factor v will be an integer and 
hence v (spin-split half) Landau levels 15 will be completely filled. 
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This then has the interesting consequence that the system realizes 
a macroscopic quantum state in which there is an energy gap for the 
creation of all charged excitations. The system is an incompressible 
liquid, essentially because the energy of the system becomes indepen¬ 
dent of area when the Landau levels are full (i.e. integer v). In this case, 
we can use the fact that the Hall coefficient I?h = — l/(ri 2 de) to find 
that the Hall resistivity p xy is 16 

Pxy = RuB = — = 1 4 • (45.32) 

n 2 ae v e z 

Thus we find plateaux in p xy whenever v Landau levels are full. Re¬ 
markably, the resistivity p xy which is measured with a battery and some 
lengths of wire depends only on two fundamental constants of Nature. 
The quantity h/e 2 = 25812.8 O has been measured with incredible pre¬ 
cision and is often called the resistance quantum. 17 Equivalently, one 
can write eqn 45.32 in terms of the Hall conductivity a xy as 

e 2 

Vxy = v—. (45.33) 

Before feeling too pleased with ourselves we should remember that this 
analysis only tells us that p xy is quantized at those values of magnetic 
field at which an integer number of Landau levels are completely filled, 
but does not explain the occurrence of the plateaux nor the vanishing 
of p xx - In fact, these two features rely crucially on the fact that some 
of the states in a Landau level are localized via Anderson localization 
(described 18 in Chapter 34 ). The idea is that we imagine starting at a 
field where a single Landau level is exactly filled and then further increase 
the magnetic field. This reduces the degeneracy of the Landau level, 
forcing some electrons into the next Landau level at a higher energy. 
These electrons will end up in the localized states of that upper level 
and hence cannot contribute to the conductivity. As a result p xx = 0, 
while p xy remains at the quantized value corresponding to the complete 
filling of the lower level, resulting in a plateau. 


45.4 The fractional quantum Hall effect 

When v = 1 only a single Landau level is occupied and this maximizes 
the effect of electron-electron interactions. Although our treatment of 
the integer quantum Hall effect ignored these and treated the electrons as 
non-interacting, this will no longer be sufficient. The intriguing physics 
that is revealed when v < 1 is accessed by using magnetic fields with 
B > n 2d h/e. In addition to the plateaux observed at integer values of v, 
experiments in high-quality samples revealed further plateaux at frac¬ 
tional values of v [see Fig. 45.2(c)]. These extra plateaux are due to the 
formation of a state of matter which can support fractional excitations 
and which is descibed by a topological field theory of the sort we met in 
Chapter 30. 19 


1 To prevent a minus sign here, we use 
the convention that B < 0, so that 
(—e)B > 0. This convention is also fol¬ 
lowed in the discussion in Chapter 7 of 
Wen. 


17 

It is also known as the von Klitz- 
ing contant, after Klaus von Klitzing 
(1943- ) the discoverer of the quantum 
Hall effect. 


l^See Singleton for a simple explana¬ 
tion of how this works in this context 
or, for more detail, the article by J.T. 
Chalker in Ecole des Houches: Topolog¬ 
ical Aspects of Low Dimensional Sys¬ 
tems, Eds. A. Comtet, T. Jolicoeur and 
S. Ouvry. (1998). 


Although we haven’t stressed topol¬ 
ogy in our treatment of the integer 
quantum Hall effect, it may also be un¬ 
derstood in these terms. See Altland 
and Simons, Section 9.3 for the details 
of Pruisken’s field theory of the inte¬ 
ger quantum Hall effect and its link to 
topology. 
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20 

The original explanation of the FQH 
effect was made by Robert Laughlin 
based on a trial wave function ap¬ 
proach. We won’t pursue this method 
here. The interested reader should con¬ 
sult Wen, Chapter 7 for a more de¬ 
tailed description of the other possible 
approaches to describing the physics. 


21 

This is a form of emergence, where 
the properties and excitations of a 
phase of matter arise upon the conden¬ 
sation of that phase. 


22 A slightly more general, but basically 
similar, theory is needed to explain the 
others. Again, see Wen, Chapter 7 for 
the full story. 

23 

We stress again that these quasipar¬ 
ticles can’t be expected to be elec¬ 
trons with a shielded charge and in¬ 
creased mass as we had for quasipar¬ 
ticles in QED and in metals. In fact, 
the FQH quasiparticles will turn out 
to have quite different properties from 
electrons and holes, including fractional 
statistics and even fractional electric 
charge! 


24 Recall from Chapter 30 that in (2+1) 
dimensions we may write the magnetic 
field as B = d\A 2 — 82 A 2 and, follow¬ 
ing convention, we take B < 0 so that 
(-e)B > 0. 


We will assemble a quantum field theory that describes this fractional 
quantum Hall (FQH) effect. 20 This will be an effective theory in that 
we won’t start with details of the electrons and holes found in a metal, 
rather with a set of fields that will model the ground state and the low 
energy or elementary excitations. 

This is an important point. Nature doesn’t allow us the luxury of 
being able to derive the properties of complex phases of matter from first 
principles. In the same way that the rigidity of solids is not derivable 
from the physics of those atoms in a gas, so the FQH effect is a different 
phase of matter from the two-dimensional electron gas from which it 
is created. 21 The properties of the phase of matter that give rise to a 
fractional quantum Hall effect cannot, therefore, be derived from the 
electron gas in zero magnetic field. So while most phases are separated 
by a breaking of symmetry, as discussed in Chapter 26, this one is very 
different. The FQH phase is topologically distinct from the electron gas. 
The phase transition between electron gas and FQH state is therefore 
topological and cannot be described by a broken symmetry theory as we 
had for the magnet, for example. 

In the remainder of this chapter we will construct a quantum field 
theory of the FQH fluid, which is a phase of matter showing a FQH 
effect in its ground state. Specifically, we will explain the existence of 
a subset of the possible FQH states: those with \/v equal to an odd 
integer. 22 We will also examine the elementary excitations of the FHQ 
fluid, which are the quasiparticles of this phase. 23 

We begin by asking what we want from our theory. Obviously we want 
the theory to predict the fundamental facts of the FQH effect. These 
are: 

• In units where K = 1, we want to predict a conductivity a xy = 
J x /E y = |- + with 1/v taking odd integer values. 

• The FQH fluid is incompressible with a charge density given by 

0 ( Number of states per \ v 

P = -eJ FQH = -e x ^ Landau level J x 

e 2 B 

= < 45 - 34 ) 

where we’ve used the fact that the number of states per unit area 
L 2 in a Landau level is l/(27r£g). 

These two facts are embodied in an equation describing the electromag¬ 
netic FQH current 24 

2 

-eJ£ QH = -^e» vX d v A x , (45.35) 

where A M is the usual 17(1) gauge field of electromagnetism. We will cook 
up a Lagrangian whose equation of motion, at the very least, predicts 
this current. 

The Lagrangian which will do this is of the Chern—Simons form and is 
thus a topological theory. It will describe another 17(1) gauge field a M , 
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2 ^ We simply replace n in Chapter 30 
with 1/2-7T here. 

J cs = (45.36) 

The key to constructing the theory is to say that the FQH current in 
eqn 45.35 will contribute to the total Chern-Simons current. Our task 
is now to find a Lagrangian whose equation of motion tells us that the 
Chern-Simons current Jcs in eqn 45.36 is given by a contribution from 
the FQH current in eqn 45.35 added to a contribution from any source 
j 11 of quasiparticles that we put in the system. This is achieved with the 
Lagrangian 

C = -lf-e^ x a^a x + fe^ x A„d v a x + (45.37) 

Z Z7T Z7T 

where s is a number. This Lagrangian is formed from three terms: I: 
a Chern-Simons term; II: a term where the electromagnetic gauge field 
Aft couples to the Chern-Simons current; III: a term where a source of 
quasiparticles couples to the topological gauge field a M . 


which couples to the electromagnetic field A We know from Chapter 30 
that in a Chern-Simons theory the field gives rise to a current J£ s oc 
e^ x d v a\. We choose to adjust our normalization slightly 25 here and 
will define the Chern-Simons current as 


Example 45.5 


We can check that this works by feeding the Lagrangian through the Euler-Lagrange 
equation to extract the equation of motion. We find that 


'5(fe = ~ %e^d„A x , + j*. 

(45.38) 


We obtain 


—e^ x d v a x = —e^dvAx+j*, 

Z 7T Z7T 


(45.39) 


J CS = + V- (45.40) 

2 ixs s 

If we identify the dimensionless parameter s from the Lagrangian as being \ jv, and 
temporarily set the quasiparticle source = 0 then this is the fractional quantum 
Hall result we set out to predict. 


Our low-energy Lagrangian would be of little use if it didn’t provide 
some further insight into the workings of the fractional quantum Hall 
fluid. We will obtain these by asking about the quasiparticle excitations 
in the FQH ground state which are created by the conserved current 
Recall from Chapter 30 that coupling a current like to the topological 
gauge field constrains the properties of these particles, attaching the 
flux of the field to each particle created by and imbuing them wth 
fractional statistics. Specifically we will use the theory to show that: 

• the FQH plateaux appear for half odd integer s; 

• the FQH quasiparticles carry fractional electromagnetic charge; 

• the FQH quasiparticles have fractional statistics. 
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26 

Potential for confusion arises here as 
there are two gauge fields in this prob¬ 
lem: (i) the U(l) gauge field of elec¬ 
tromagnetism A^, which gives rise to 
the electric field E and magnetic flux 
density B , whose excitations couple to 
the charge —e; and (ii) the topological 
U( 1) gauge field a M , giving rise to an 
electric field e and magnetic flux den¬ 
sity 6, whose excitations couple to the 
charge l. The FQH quasiparticles carry 
integral a M -charge l and also A^-charge 
q , whose value we will determine. As 
the current of FQH quasiparticles is 
coupled to the topological field a^ we 
expect, from Chapter 30, that they will 
also carry flux of the 6-field (which we 
refer to as a^-flux). 


27 /~i 

In Chapter 30 we saw that the 
Chern-Simons charge density is related 
to the flux density via p(= J°) = — K,b 
where, for our quasiparticles p = l/s 
per unit area. Note also that k = ^ 
and we take 6 < 0. 


Our source of FQH quasiparticles will creates excitations with a M - 
charge 26 Z, where l is constrained to be an integer. We will construct the 
source term = laoS( 2 \x — xo), where we have chosen to localize 

the quasiparticle source at position xq. The zeroth component of the 
equations of motion predicts a total electric charge density of 

-eJcs = ~ x o), (45.41) 

where the first term on the right-hand side is the electric charge density 
expected from the FQH ground state and the second term is the extra 
electric charge of the quasiparticle. This equation implies two things: 
firstly that the FQH quasiparticle have electric charge — el/s ; secondly, 
returning to the properties of Chern-Simons theories from Chapter 30, 
we see that this excitation carries a^-flux of 2nl/s. This is progress, but 
we have yet to establish the values of s. 

Next we find the statistics of the FQH quasiparticles. We picture two 
such excitations with charges 1 1 and l 2 . Move one in a circle around the 
other and we obtain an Aharonov-Bohm phase: 


A$ = (cq, flux) x (quasiparticle charge) 

= 2tt- x l 2 . (45.42) 

s 

Now we set h = l 2 = l to obtain identical quasiparticles. Using the 
method of Chapter 30, 27 we note that exchanging these particles means 
rotating one through an angle ir around the other so that we expect a 
phase change mj, where 77 will determine the statistics of the quasipar¬ 
ticles. The corresponding phase change A<F is half that in eqn 45.42, 
implying that we have statistics given by 


77 = 


Z 2 

s 


(45.43) 


Our conclusions so far are that the excitations of the FQH ground state 
are quasiparticles with o M -charge of l, which carry an electric charge 
— el/s , carry a^-flux of 2n/s, and cause a phase change on exchange 
governed by 77 = l 2 /s. Since we know that v = 1/s, which experiment 
teaches us takes values such as 1/3, this suggests that the quasiparticles 
carry a fraction of an electron charge and possess fractional statistics! 
However we need to complete the puzzle by establishing the necessity 
for s to be an odd integer. 

In order to complete the description it turns out that we must appeal 
to the presence of electrons and holes in the metal and link these to 
our FQH quasiparticles. Following convention, we will call our FQH 
quasiparticle excitations quasielectrons and quasiholes since, despite the 
fact that we cannot adiabatically tune between the FQH quasielectrons 
and the quasielectrons of a metal, we hope that their properties can 
at least be related. To establish the relation, we note that a single 
electron has electric charge — el/s = — e, it therefore has l/s = 1 and 
carries l = s units of a M -charge. Its exchange property follows as rj = 
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s. However, we know that electrons are fermions and must therefore 
have an exchange angle of an odd integer. We therefore deduce that 
s = odd integer, which is exactly the property we wanted to explain. 
We may now conclude that the FQH quasiparticle has fractional electric 
charge —el/s and fractional statistics governed by 77 = l 2 /s. 


Example 45.6 

Let’s examine the case of s = 3. Electrons with l = s = 3 units of a^-charge form 
the FQH fluid with filling factor v = 1/3. The excitations of this liquid are FQH 
quasielectrons with a^-charge l = — 1 and quasiholes with 1 = 1. Quasielectrons 
have fractional electric charge —e/3, they carry a^-flux of 27 t/ 3 and have statistics 
governed by 77 = 1/3. 


Chapter summary 

• In a magnetic field the Hamiltonian is invariant under a combina¬ 
tion of a translation and a gauge transformation. This leads to the 
physics of the Hofstadter butterfly 

• The integer quantum Hall effect follows from considering the oc¬ 
cupancy of Landau levels as a function of applied magnetic field in 
a two-dimensional electron gas. 

• The fractional quantum Hall effect is caused by interactions and 
may be explained using an effective, topological field theory. 


Exercises 


(45.1) Verify eqn 45.8 using the method suggested in the 
text. 

(45.2) Verify eqns 45.14 and 45.15. 

(45.3) (a) Use the creation and annihilation operators de¬ 
fined in eqns 45.18 to diagonalize the Hamiltonian 
in eqn 45.11. 

(b) Evaluate X 2 + Y 2 to show that each electron 
occupies an area of 2tv£%. 

(45.4) A reminder of the Hall effect. 

Consider driving a current of density J x in the x- 
direction along a finite sized conductor, in the pres¬ 
ence of a magnetic field B directed along 2 . Con¬ 


sider the equations of motion of a charge in the 
relaxation time approximation 

YYIV 

mv = - h qE + qv x B, (45.44) 

r 

where r is the relaxation time. 

(a) In the steady state, show that E y /E x = —lo c t. 

(b) Verify that the Hall coefficient is given by 


I?h = 


y 

J X B 


1 

ne’ 


(45.45) 


where n is the electron density. 





Part XI 


Some applications from the 
world of particle physics 

Quantum field theory is foundational for particle physics, since particles 
are simply excitations in quantum fields. In this part, we review several 
important particle physics theories which use the ideas developed in this 
book. 

• Our treatment of gauge theory in Chapter 14 assumed that the 
underlying symmetry group is abelian (i.e. its elements commute). 
This is the case for 17(1), the group that describes electromag¬ 
netism. In Chapter 46 we introduce non-abelian gauge theory and 
show that the non-commutativity of group elements leads to a non¬ 
linear field tensor. 

• A non-abelian gauge theory is at the heart of the electroweak the¬ 
ory due to Weinberg and Salam and this is presented in Chapter 47. 
Symmetry breaking of the Higgs field results in the emergence of 
the photon, the charged W + and W~ particles, and the neutral 
Z° particle. The photon is massless, but the other three acquire 
mass and so the weak force becomes short-ranged. 

• Majorana fermions, particles that are their own antiparticles, are 
described in Chapter 48. 

• Chapter 49 introduces two varieties of magnetic monopole: one 
due to Dirac which provides a link between the quantization of 
electric charge and the quantization of the magnetic charge of a 
monopole; the other due to ’t Hooft and Polyakov which arises 
from a non-abelian gauge theory. 

• The final chapter, Chapter 50, introduces the concept of instan- 
tons, which allows us to treat problems involving tunnelling. We 
ask what would happen if the Universe were to tunnel from a false 
vacuum into the true vacuum. 




Non-abelian gauge theory 


Physics is very muddled again at the moment; it is much too 
hard for me anyway, and I wish I were a movie comedian 
or something like that and had never heard anything about 
physics. 

Wolfgang Pauli (1900-1958) 

QED is a very successful field theory built on the gauge principle which 
accurately describes electromagnetism. Applying this principle involved 
taking a theory with a global (internal) U{ 1) symmetry and promoting 
this to a local symmetry, necessitating the introduction of a gauge field 
A^(x). Turning our attention now to particle physics, we would also like 
to explain the weak and strong forces using fields. The weak and strong 
interactions can also be described by gauge theories, but unfortunately 
neither are based on the simple case of local U{ 1) symmetry. In fact, 
both are based on local symmetry under more complicated groups of 
transformations: for the weak interaction it is the group U( 1) (g> SU(2) 
and for the strong interaction it is SU(3). Unlike U( 1), these groups are 
non-abelian. 

A non-abelian group is formed from elements that don’t commute: a 
transformation by element a followed by one by element b has a different 
effect to the transformation by b followed by a. Non-abelian groups 
are quite familiar from everyday life, the rotation group SO( 3) being 
a simple example. Figure 46.1 shows that a rotation R\ of a book by 
7t/2 about x followed by rotation i? 2 by 7r/2 about 2 leads to a different 
final orientation of the book from the rotations f? 2 then R\. In this 
chapter we examine the features of a gauge theory with a local non- 
abelian symmetry. We will concentrate on the simplest possible non- 
abelian cases SU( 2) and SO( 3). We start by reviewing the important 
properties of an abelian gauge theory. 

46.1 Abelian gauge theory revisited 

In Chapter 14 we looked at the simplest possible gauge theory. Fun¬ 
damentally it is a theory with a local symmetry. It is symmetric with 
respect to U( 1) transformations which are different at every point in 
Fig. 46.1 A demonstration that the el- s P ace time - U(l) is abelian since two transformations, applied one after 
ements of the non-abelain group SO( 3) another, always lead to the same result, no matter the order in which 

don’t commute. they’re applied. As a reminder, here is the complex scalar field La- 


46 
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non-abelian gauge theory 
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Niels Hendrik Abel (1802-1829), af¬ 
ter whom abelian groups are named, 
died aged 26 from tuberculosis. Non- 
abelian group theory was introduced by 
Evariste Galois (1811-1832) who was 
killed in a duel aged 20, having spent 
the entire night before writing down a 
revolutionary theory involving the use 
of groups to investigate the solubility of 
algebraic equations. 
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grangian: 

C = (D^Y(D^) - mW> - (46.1) 

where D^ip = {d^ip + i qA^i/j). This is a locally gauge invariant quantity 
under the simultaneous transformations 

tp —> ipe ia ( x ), —> ^d^a(x). (46.2) 

In order to make the jump to more complicated symmetries we will deal 
with these in the form of infinitesimal transformations. 


Example 46.1 

We’ll replay the proofs that the Lagrangian is invariant with respect to local gauge 
transformations using infinitesimal transformations. 

(1 + i a)xp, 

d^xp + i(9 M a)V> + ia^d^xp), 

~ -dpafa). (46.3) 

<? 

This works up to order 0(a) because we have 

xp^xp —> (xp^ _ iaxp^)(xp + iaxp) = xp^xp + O(o 2 ), (46.4) 



and 


D 


dpip + i(d^a)xp + ia^d/j.tp) + i q {xp + i axp) 

[d^xp + iqA^xp] + i a [d^xp + i qA^xp] + 0(a 2 ) 

(1 + io)D M V’ + 0(a 2 ). (46.5) 


Equation 46.5 shows that D^xp transforms exactly the same way as ip. This, in turn, 
means that (D^xp)^(D^xp) —> {D^xp)^ (D^xp) + 0(« 2 ). So all of the terms involving 
xp are invariant with respect to the transformations. 


46.2 Yang—Mills theory 


In 1954 Chen-Ning Yang and Robert Mills asked if the theory of local 
symmetry could be extended to more complicated groups of transfor¬ 
mations than (7(1), such as £(7(2) and SO{ 3). These groups are non- 
abelian. Although an SU (2) gauge theory doesn’t describe the weak or 
strong interactions, it does display the properties of a more complicated 
non-abelian theory and so we will consider it in this chapter. 

We ask what needs to be done to make a theory with a global internal 
SU( 2) symmetry also symmetric under local internal SU( 2) transfor¬ 
mations. We’ll use exactly the same strategy we used to upgrade the 
global internal (7(1) symmetry to a local one. We will first work out the 
consequences of imposing a local symmetry and then we will add a held 
which cancels out any extra terms that arise. 


Chen-Ning Yang (1922- ) 

Robert Mills (1927-1999). 

A gauge theory based on the SU(N) 
group is called a Yang—Mills the¬ 
ory in honour of Yang and Mills who 
extended gauge theory to non-abelian 
groups in 1954. 
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Here’s the Dirac Lagrangian for two sorts of fermions, each with mass 


^We use t for the Pauli matrices 
here, rather than cr, to emphasize that 
they act on our states of two sorts of 
fermions, 4 f = ( f g ), rather than 
on the spin of a single fermion, i.e. they 
are Pauli ‘isospin’ matrices, rather than 
Pauli ‘spin’ matrices. 


Noether’s theorem: 
nal symmetry 

SU (2) inter- 

D a V = |r a 4' 

(a: internal) 

n£ = Wry" 

(g: vector) 

DC = 0 

W* = 0 

J&= 

<?Nc = /d 3 xl4'tT“'L 



As examined in the exercises, I z = 
| f d 3 x (/t/ — gt g). We might say that 
the 4/-field carries I = 1/2 units of 
isospin, with /-on particles correspond¬ 
ing to I 3 = 1/2 and g -on particles to 
I 3 = —1/2. Isospin will be important 
in the next chapter. 


m: 

£ = /(i7 M d/i - m)f + 9( 17 ^ ^ TO )fo (46-6) 

which we write more compactly as 

C = 'I , (i7 A1 <9 M — to)'!', (46.7) 

where 

*=('), 4' = ( / g). (46.8) 

This Lagrangian is invariant under a global SU (2) transformation 

$-4e5 T “$, vp -> (46.9) 

where r = (t x ,t v ,t z ) are the Pauli matrices. 1 The Noether currents 
arising from this symmetry are given in the box. The conserved charge¬ 
like quantity for this theory is the isospin I = f d 3 # 4^ 

To investigate further we’ll be working with infinitesimal transforma¬ 
tions, so in infinitesimal form we can write 

T —> (l + X -t ■ a) T. (46.10) 


Example 46.2 

We can show that this is a global symmetry of our Lagrangian. As before we write 

— T • CK J ( 1 H- T • C* J 4/ 

2 J \ 2 J 

= '£<!' [l + 0(« 2 )] . (46.11) 



Since we’re working with infinitesimals, we agree to jettison any term above first 
order. Since, for a global symmetry the derivative £^4' transforms exactly as does, 
(since a has no dependence on spacetime) the entire Lagrangian is invariant. 


Now for the pivotal point of this chapter. We upgrade this argument to 
a local transformation by letting a be a function of spacetime x. This 
is no problem for the mass term in the Lagrangian ’Lm.'I', which is still 
invariant. The trouble comes from the derivative, since 

-> d ^ + 1 [r • a(x)\ d ^ + 1 [r • d^a(x)} T. (46.12) 

and the final term in eqn 46.12 prevents d ^4/ transforming like ’S' and 
2 Remember that the 17(1) version of therefore prevents the derivative term from being invariant. 2 
this is df,i> — > d ll ,'il>+ia(d ll ,4>)-\-i(d ll a)'ip. Just as we introduced the gauge field A M ( x) for the 17(1) case, we 

need to introduce a new gauge field which will mop up the term in¬ 
volving d l _ l a(x). This gauge field is called W^x) and has three internal 
components [that is (W^{x), W^(x), W^(x))], each of which have four 
components in Minkowski spacetime (labelled with the index /r). Just 
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as before, the new field is combined with the derivative operator to make 
a covariant derivative. We therefore define a covariant derivative 3 

Dp=8p~ l -gr ■ W^x), (46.13) 

where g is the charge of the theory (it’s the quantity which will even¬ 
tually tell us how strongly the gauge field Wp interacts with T). If 
this procedure is going to work, then D^ must transform in exactly 
the same way as \f r does. That is, the manner in which the gauge field 
transforms must cancel out the extra term in the transformation of the 
ordinary derivative. 4 This will indeed be the case, but the fact that the 
transformation is non-abelian leads to a new complication. 


Example 46.3 

We want the derivative to transform in the same way as 4/, which is to say that it 
should transform as 

D ^ —> (l + • a(x)j D^. (46.14) 

This implies 

Dp\i -> (l + ^' r cx ( x )^ ^9T- Wk(x)'I') 

= dpty - ^g-r ■ Wp(x)'& + ■ ct(x)dp,y 

- (^) 9[' r ' «(*)] [t • Wp(x)^\ . (46.15) 

So how do we get it to work? We suppose that the covariant derivative transforms 
as 

Dp^ dp - l -gT ■ Wp(x) - ^gr ■ SWp(x), (46.16) 

where <514^ (x) is the change in the gauge field W^(x) with the transformation. We 
also know that the field transforms as ^ —> [l + • cx(x )] 4/. Combining these 
transformations shows that 

DpV -*• (dp- \gT Wp{x) - ^gT ■ 8Wp{x)^j (l+ l -T ■ 

= dpty + ^t • [dpa.{x)\ 4> + • oc(x)dp’& 

-y gr ■ YVp(x)-H - Q) g [t ■ Wp(x)\ [t ■ ot(x)] 4 1 

— ~gT ■ 5W^(x)^, (46.17) 

ignoring the aSWjj, term as it’s second order. 

By comparing terms in eqns 46.15 and 46.17 we find that for our gauge field to 
work we require 

T ■ 8Wp{x) = -T ■ [dpa{x)} + ^ {[t • Oi(x)} [t ■ Wp(x)} - [t ■ Wp(x)} [r • <x(x)]} • 

9 2 

(46.18) 

Notice that this last term survives only because the transformation is non-abelian 
and the two transformations involve t entering in different orders. Finally, we can 
use the identity 

(t ■ cl)(t ■ b) = (a ■ b) it ■ a x b, (46.19) 

and conclude that 

T • 5W^(x) = —t • [d^oc(x)\ — t • [ck(x) x Wn(x )]. (46.20) 

9 


^ Recall that for U( 1) we have D^ = 
d[i + i qA^. The equation here is for¬ 
mulated analogously with, by conven¬ 
tion, a sign change reflecting the fact 
that our chosen SU( 2) rotation e 2 T a 
rotates in the opposite (internal) direc¬ 
tion to the 1/(1) rotation e ia . 


4 Note that, 
plicitly, the 

tive 


written out ex¬ 
covariant deriva- 
dpf 
dpg 


2 9 


w p 
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5 For comparison, in the t/(l) case this 
transformation is A M —> A M — ^d^a. 


5 This is a surprising result: the field 
strength is a nonlinear function 

of the gauge field W^. We thus have 
found ourselves a nonlinear interacting 
theory, even in the absence of matter! 


The result of the preceding example is that the gauge field must 
transform 5 according to 

r • W„ - r • Wp + -t ■ (0 M a) - r • (a x Wp). (46.21) 

46.3 Interactions and dynamics of W fi 

Next we ask how the gauge field couples to the fermion particle field \E r . 
Minimal coupling, which we first met in Chapter 14, tells us that this 
is simply a matter of multiplying out the term in the Dirac Lagrangian 
involving the covariant derivative D We have 

4dy = 4'i7 M <9 /J 4' + |'Fy^r • . (46.22) 

Compare this to the U(l) version 

- # 7 ^(46.23) 

which told us that the excitations in the U(l) gauge field A^(x) mediated 
interactions between electrons by coupling to the charge q. Furthermore, 
the Feynman rule for the interaction was that a *F'FA M interaction vertex 
contributed a factor —i< 77 ^. By analogy, the SU( 2) version suggests that 
the Feynman rule for a 'F'FTy i vertex is the contribution of a factor 
proportional to h/ry^. After quantization, there will be excitations in 
the Wfj, gauge field which mediate the interactions between the fermions 
of the theory. Explicitly, we expect there to be massless W 1 , W 2 and W 3 
photon-like particles, each with two possible transverse polarizations. 
The particles couple to the isospin of the 'F field with a strength set by 
the charge g. 

When we looked at the f/(l) theory, an additional, but very impor¬ 
tant contribution to the Lagrangian was provided by the gauge field 
Afj, ( x ) itself, whose dynamics gave us electromagnetism through the term 
— ^F IXV F IJ,V . We now ask what contribution the gauge field W^(x) makes 
to the Lagrangian. An obvious guess would be that we should add a term 
of the form — | ■ G^ where G MI/ is a tensor analogous to F^ in the 

electromagnetic case. The obvious guess for the form of this tensor is 
but since we will want G to transform like 
and d^, it turns out that the tensor we need is 6 

G„ v = d^W v - dvWp + g(W^ x W v ). (46.24) 

Our final expression for the Lagrangian of a locally invariant SU(2) 
theory is then given by 

C = (iy- m)-F - l -G „„ • . (46.25) 

The incredible thing about this Lagrangian is that the nonlinearity of 
Gfj, v allows a new sort of process to take place, namely, the direct in¬ 
teraction of particle excitations in one of the fields with other 
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particles, as shown in the interaction vertex in Fig. 46.2, where we have 
three particles interacting. (The Lagrangian also allows the inter¬ 
action of four Wfj, particles. 7 ) Note that photon-photon interactions are 
not possible in abelian electromagnetism as the photon carried no elec¬ 
tromagnetic charge of its own. In contrast, the isospin field carries 
an isospin charge of one unit (/ = 1) so remarkably it may act as a 
source of itself. 8 

This self-interaction of the gauge field provides an explanation for a 
feature that makes Yang-Mills theories so interesting: the asymptotic 
freedom described in Chapter 34. As we probe further away from an 
electromagnetic charge (described by abelian 17(1) electromagnetism) 
the charge is shielded by dipoles formed from virtual electron-positron 
pairs and the electric charge appears to get smaller. However, the non- 
abelian gauge held produces a marked difference as we probe further 
away from a quark (where the strong interaction of colour charge is 
described by a non-abelian SU( 3) gauge theory). The screening effect 
would be the same for the quark as for the electron, except that we have 
the gauge held self-interaction which allows the gauge held quantum (a 
gluon ) to decay into a pair of gluons. As the gluons themselves carry 
colour charge they lead to a proliferation of like colour charges around 
the quark leading to an increased colour charge at large distances. This 
is asymptotic freedom: the interaction gets stronger as we probe further 
from the quark. 


Example 46.4 

A free, globally ,S'0(3) invariant scalar field theory was introduced in Chapter 13. Its 
Lagrangian is 

1 m 2 

C = • •£. (46.26) 

where 4? = (o\. <f) 2 , 4> 3 ). This theory is invariant with respect to the global (internal) 
rotation given by 4>(x) — > 4?(a:) — 6 X <I> (x). Again, we upgrade this to a local 
transformation by letting 6 , the internal angle of rotation, be a function of spacetime 
x. That is, the transformation is now 4?(x) —> 4? ( x ) — Oix) X 4 > (.r). To make a locally 
invariant theory we need a gauge field, leading to the covariant derivative: 

D p O = 9 )j 4'-9$xH{ 1 . (46.27) 

The correct Wf to choose is one that transforms as 

W^^W^ + -d li e-(exW^, (46.28) 

9 

where g is the charge of the theory, and as for SU (2) we add a term — ^G^ u • G^ u to 
the Lagrangian in eqn 46.26, where for this theory G/j U = d^Wv — d t /Wf + X 

W£). This theory describes three massive scalar fields (the three components of 4>) 
and three massless gauge fields (the three internal components of W^). 


This completes our discussion of the properties of a non-abelian gauge 
theory. 9 As a summary, the gauge fields that we have met so far are 
listed in the margin on page 430. 


The terms leading to the interactions 
of three and four particles can be 
seen by expanding G^ • G^ u . 


® Since the algebra of isospin mirrors 
that of ordinary spin, we say that the 
excitations of the 1 = 1 field are 
particles with I z = 0 created (and 
destroyed) by the field VPj; particles 
with I z = 1 created by the combi¬ 
nation -^=(W^ + [and destroyed 

by ^=(WjJ — iW^)]; and particles with 
I z = — 1 created by [and 

destroyed by + i W 2 )\. 



Fig. 46.2 The interaction of three W 
particles is possible in a gauge theory 
with local SU(2 ) symmetry. 


9 The next logical step would be to 
quantize the Yang-Mills Lagrangian. 
This can be done using the path inte¬ 
gral but requires some tricks that we 
won’t pursue here. See Peskin and 
Schroeder, Chapter 16 for the details. 
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(1) Abelian U( 1) gauge theory for 
the complex scalar field 'ip: 

where 

D= d^ip + iqA^x)^ 

F'fj.„ = d^Av - dvAfj,. 

The symmetry is 

i> -> ip e ia< - x) 

and the gauge field transforms as 



g 


(2) Non-abelian SU{ 2) gauge the¬ 
ory for fermions described by a 
field T: 

c = Vii^Du - m)V - ■ G^ 

where 

D^ = d^- '-g-r- V^(x)W 

G„ v = d^W v -d v W IL 
+g{Wu x W v ). 

The symmetry is 

\J> —> \^q^ tol ( x ) 

and the gauge field transforms as 

W^-^W I1 + -d^a -(ax W„). 

9 


46.4 Breaking symmetry with a 
non-abelian gauge theory 


We saw in Chapter 26 that spontaneous symmetry breaking in a gauge 
theory led to the famous Higgs mechanism. Here we treat exactly the 
same problem with the more complicated example of a non-abelian 
field. This is useful, not only because it provides yet another exam¬ 
ple of the wonderful physics of symmetry breaking, but also because the 
electroweak theory examined in the next chapter relies on exactly this 
physics. 

We will examine the case of SO{ 3) gauge theory with an </> 4 -type in¬ 
teraction and, crucially, a positive, symmetry-breaking mass term, with 
a Lagrangian given by 

I 777 ^ 1 

C = _£>/*$ ■ D^<f> + — $ • $ - A($ • $) 2 - -G^ ■ G^. (46.29) 


The potential possesses a spherical shell of minima at a radius (in in¬ 
ternal $ space) of |$ 0 | = (fy)'- Symmetry breaking involves picking 
one of the infinite number of equivalent vacua, which breaks global (and 
local) symmetry. For simplicity, we chose the one that points along the 

3-direction in isospin space, that is = (qx) 2 ®3- F° r the excitations 
above this ground state, we write 


/ $,(*) \ 

$ = $ 2 (*) , (46.30) 

\ l$o| + x( x ) ) 

where x( x ) describes deviations in the 3-direction. Then, after a bout 
of tedious algebra, we find 


(3) Non-abelian 50(3) gauge the¬ 
ory for real scalar fields <i>: 

1 m 2 I 

C = -DP&-D U & - & &--G uu 

2 2 4 

where 

D^ = d^-g&xwp 

G llv = 

+9(W x W v ). 

The symmetry is 

$ -» $ - 0(x) x $ 

and the gauge field transforms as 

W w + -d„e- (ex w„). 

9 


£ = \ [(^$i) 2 + (d^ 2 ) 2 + {d^x)' 2 ] - 4|^ 0 | 2 Ax 2 
+5l*ol [{d^Wl - (d^ 2 )wp\ 

+ d|T [ (W H) 2 + (W 2 ) 2 ] - \g^ ■ + ..., (46.31) 

where we ignore higher order terms. The first line describes the dynamics 
of the xi x ) scalar field, the final line the dynamics of the W^(x) vector 
field, but the terms that are mixed in components of $ and aren’t 
easy to interpret. However, there is a useful trick we can use: we select 
a gauge which ensures that the excited state field 3? points along the 
3-direction in isospace at every point in spacetime. (This is known as 
unitary gauge.) This removes all mention of the components 4>i and 
$2 and means that the excited state field becomes simply 


®(x) = [|$o| +X(a0] e 3 - 


(46.32) 
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Example 46.5 

Since the term = <9^4? — </4> X mixes up the components of 3>, our choice 


of unitary gauge yields 



i 

= gW 2 (|<t> 0 |+x), 


D^2 

= -gW'{\* o|+x), 


O fJ ^3 

= d^x, 

(46.33) 


giving, 

(D m *) 2 = (<9 m x) 2 +9 2 |^o| 2 [(Wlf + (W 2 ) 2 ] + ... (46.34) 

This solves the problem of the mixing of components of & and fields. 


The broken symmetry Lagrangian is now given by 

c = ^x) 2 -4|* 0 |Ax 2 

+ [(^) 2 + (W 2 ) 2 ] - \g„ v ■ G» v + ..., (46.35) 

which is the sum of Lagrangians for a massive, single-component scalar 
held x and a massive vector held with two components Wj t and W 2 . 
Symmetry breaking has caused two massive scalar helds (<f>i and $ 2 ) to 
disappear from our original Lagrangian and we are left with one, called 
X- The vector helds It')) and W 2 have grown massive, taking on a mass 
5|<& 0 | ( see Fig- 46.3). The held W 0 remains massless. 

In summary, symmetry breaking of the 50(3) gauge theory causes: 


( 3 massive scalar fields $ 

3 massless photon-like fields VP)) 


1 massive scalar field 


X 


2 massive vector fields H , 

A 4 

1 massless photon-like field 



We may check that we haven’t lost any degrees of freedom. Massive 
scalar helds only have a single degree of freedom, while massless photon¬ 
like helds have two. 10 This makes nine on the left. Noting that massive 
vector helds have three degrees of freedom we see that we also have 
nine on the right and all is well. Notice, finally, that we could describe 
electromagnetism by taking Wj^(x) = A /1 (x). This immediately leads to 
the question of whether electromagnetism in our Universe results from 
breaking symmetry in an 50(3) or 517(2) non-abelian theory. 

Is all this non-abelian gauge theory simply a mathematical exercise 
to amuse the theoretically inclined? On the contrary, it seems that the 
Universe has some non-abelian symmetries etched into it. In the next 
chapter we will apply what we have learnt about symmetry breakdown 
in a non-abelian gauge theory to formulating the electroweak theory 
of Steven Weinberg and Abdus Salam, and we will discover that the 
symmetry group we break is a little more complicated than 50(3). 


X 




Fig. 46.3 The gauge fields and 
W? eat the 4> Goldstone modes and 

A* 

acquire mass when global symmetry is 
broken in a non-abelian gauge theory 
with local SU{ 2) or SO (3) symmetry. 

^ Remember the case of electromag¬ 
netism! 
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Chapter summary 

• A non-abelian group is formed from elements which do not com¬ 
mute. The introduction of a non-abelian local symmetry leads to a 
non-abelian gauge field. The field tensor has an extra term which 
makes it nonlinear. 

• The consequences of symmetry breaking were demonstrated for 
SO( 3) theory. 


Exercises 


(46.1) Show that I z = | f d 3 x(p f — gUg). 

(46.2) (a) Write an SO( 3) rotation about the 2 -axis in in¬ 
finitesimal form. 

(b) Show that the Lagrangian in eqn 46.26 is invari¬ 
ant with respect to global SO( 3) transformations. 

(c) We want to arrange matters so that the theory 
is invariant with respect to local SO (3) transforma¬ 
tion. As in the abelian case, the trouble comes from 
the derivative term. Show that 

<9 M 3>(a;)—>d M $(*) — d M 0(x)x$(a;) — 0(x)xd fl ^(x). 

(46.36) 

(d) Verify that D M <1? transforms in exactly the same 
way as <f>. 


(46.3) (a) Show that, for (7(1) gauge theory, the commu¬ 
tator of the covariant derivatives gives 

[£>**, D v \ = i q(d f ‘A v - d"A M ) = i qF^. (46.37) 

This suggests that we can extract the analogue of 
for non-abelian gauge theory by evaluating the 
commutator. This turns out to be the case! 

(b) Show that the analogous commutator for 5(7(2) 
gauge theory is 

[D M , D v ] = \gr ■ [d^W 1 ' - d v W' 1 + gW 1 * x W v ) 
= ^grG^. 

(46.4) Verify eqn 46.31 and the simplified form eqn 46.35. 
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As theorists sometimes do, I fell in love with this idea. But as 
often happens with love affairs, at first I was rather confused 
about its implications. 

Steven Weinberg (1933- ) on symmetry breaking 


I remember travelling back to London on an American 
Air Force (MATS) transport flight. Although I had been 
granted, for the night, the status of a Brigadier or a Field 
Marshal -1 forget which - the plane was very uncomfortable; 
full of crying servicemen’s children - that is, the children 
were crying, not the servicemen. I could not sleep. I kept 
reflecting on why Nature should violate left-right symmetry 
in the weak interactions. 

Abdus Salam (1926-1996) 
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The Universe as we know it is the result of a symmetry breaking phase 
transition. In this chapter we will explain the features of a Universe in 
which: 


• Electrons have mass, but neutrinos do not. 1 

• Electrons may occur with left- or right-handed chirality, neutrinos 
are only observed with left-handed chirality. 

• The photon is massless. 


1 Of course neutrinos are not really 
massless, as shown by neutrino flavour 
oscillation experiments. Their mass is, 
however, undoubtedly small compared 
to the electron mass and in this chapter 
the Universe we describe is an approx¬ 
imation to ours. 


The explanation of these fundamental properties emerges from a model 
formulated by Abdus Salam and Steven Weinberg which unites the elec¬ 
tromagnetic and weak interactions. At the heart of the Weinberg-Salam 
electroweak theory lies a proposition: we live in a broken symmetry 
Universe and the particles that we observe in Nature are a result of 
the symmetry breaking. 2 In order to make progress with this idea we 
will write down a Lagrangian for the electroweak Universe before this 
symmetry breaking phase transition took place. We will propose a set 
of local symmetries for the Lagrangian. As usual, this will require that 
we introduce gauge fields to force the systems to be locally symmetric. 
We will then examine the consequences of the spontaneous breaking of 
symmetry on the particle spectrum of the system. Let’s make a start! 


Abdus Salam (1926-1996) is notable, 
not only for his many achievements in 
quantum field theory, but for a tireless 
advocacy for the development of science 
in the third world. 

Steven Weinberg’s (1933- ) influence on 
quantum field theory is difficult to over¬ 
state. See his three volume masterwork 
on the subject for a profound and dif¬ 
ferent take on the material. 

2 Although we won’t consider muons 
and tauons, these leptons may also be 
included in addition to the electron. 
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3 

Strictly speaking this should be called 
a U( 2) symmetry. See Penrose for de¬ 
tails. 


4 That this model doesn’t provide an 
explanation for this feature of Nature 
has led to much work over the last few 
decades regarding more general ‘super- 
symmetries’ that may be required to 
describe the Universe more completely. 


Kazuhiko Nishijima (1926-2009). The 
relation was found by Nishijima and 
Tadao Nakano, and independently by 
Murray Gell-Mann. Thus it is often 
also known as the NNG rule. 


47.1 The symmetries of Nature before 
symmetry breaking 

Imagine the Universe a moment after the Big Bang. It is more symmet¬ 
rical than the Universe in which we live. Specifically, the lepton fields 
enjoy an internal SU(2)®U(1) symmetry. 3 In around 10~ 12 seconds the 
Universe will cool below a temperature 10 16 K and undergo a symmetry 
breaking phase transition. Before this occurs we will assess the state of 
the lepton fields of the Universe as described by the Weinberg-Salam 
model. 

Before symmetry breaking neutrinos and electrons are both massless 
particles. Electrons are possible in both left- and right-handed forms, 
but neutrinos only exist in left-handed form. 4 Neutrinos and electrons 
are fermions, and are therefore excitations in Fermi fields. We know 
what to expect from their Lagrangian: it’s a Dirac Lagrangian for each 
held, without any mass terms: 

C = v e (x)i 0u e (x) + e L (x)i0e h (x) + e R (x)i0e R {x), (47.1) 

where we’ve written the right-handed electron held as e R (;r) and so forth. 

The interesting thing about having several helds in a theory is the 
possibility of internal symmetries. As we have seen, symmetries lead to 
conserved charges and one of our tasks in formulating the theory is to 
decide the values of the conserved charges that excitations in each of 
our helds will carry. Below we will assign two mysterious charges to our 
helds, Y and /, which we apparently pluck from the air on the grounds 
that they lead to a successful theory that agrees with all experimental 
observations. It should be noted that these charges are related to the 
more familiar electromagnetic charge Q via the Gell-Mann-Nishijima 
relation: 

Q = h + (47.2) 

The known values of Q for the particles in our Universe will constrain 
the values of I and Y we assign. 

We begin by writing the helds as components of a large column vector: 

/ v e(x) \ 

*(*) = e L (z) . (47.3) 

\ e R (ar) ) 

Next we ask what symmetries this theory has. The first is a local 17(1) 
symmetry. The 17(1) transformation causes the held to pick up a phase 
factor e 1 ^). To ensure invariance we introduce a gauge held B^ix) 
which transforms as B^{x) —» B /Ji (x) + Y^d tl /3(x), where Y is the charge 
of the theory and the parameter g' tells us how strongly particles will 
couple to the hypercharge (just as we write q = Q\e\ and |e| tells us 
how strongly a photon couples to a single electronic charge). Rear¬ 
ranging the position of charge Y slightly in the equations and rescaling 
for future convenience, we’ll rewrite the transformation as esW^U) a nd 
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+ jyd^/3. The former expression tells us that the strength of 
the transformation at a point in spacetime depends on how much charge 
a field carries. 5 Note carefully that the 17(1) charge Y isn’t the electric 
charge; instead, Y is known as the weak hypercharge. We assign the 
fields hypercharges as shown in the table in the margin. Thus under 
17(1) transformations we have 



To summarize so far: we may turn the internal dial labelled ‘17(1) weak 
hypercharge’ without changing the properties of the state of the early 
Universe. The dial serves to multiply the fields by position-dependent 
phase factors, whose effects are cancelled by the gauge field B^fx). 

The internal 17(1) symmetry is not the only local symmetry possessed 
by the early Universe. There is also a local 517(2) symmetry. A 517(2) 
transformation results in the field components acquiring phases whose 
values depend on the details of the conserved charge 6 I , known as weak 
isospin, carried by each field component. The weak isospin I obeys 
the usual rules of spin angular momentum so, for example, the third 
component of isospin for a field with 7=1/2 has eigenvalues I 3 = ±1/2. 
We assign isospin quantum numbers to particles as shown in the table in 
the margin, with the result that the fields undergo the transformation: 




eR —»eR. 


(47.5) 


For 517(2) transformations the gauge field needed to guarantee local 
invariance is (a:), which transforms according to r ■ —> r • W/ + 

It • (d^a) — r • (a x where g is the coupling to the weak isospin. 7 

Summarizing again: the dial labelled ‘517(2) weak isospin’ may be 
turned at will in the early Universe without any physical consequence. 
This dial mixes up the massless fields v e and eL and multiplies them by 
position dependent phases. Its effects are cancelled by the gauge field 
Wni x). The transformation has no effect on eR. 

We have now described a set of local 17(1) and 517(2) transformations 
for our set of field components. Together they make the theory locally 
invariant with respect to the enlarged group of transformations 517(2)0 
17(1). In order to make these symmetries of the Lagrangian we now need 
to replace derivatives d M in the Lagrangian with covariant derivatives. 
We write 


5 

The reason for doing this here is that 
the different components of the field 
are going to carry different amounts of 
charge. We will call Y a 4 C/(1) charge’ 
because it controls the coupling be¬ 
tween the fermions and the gauge fields 
we need to introduce to guarantee local 
U(l) symmetry. 


Field 

Ve 

eL 

eR 

Y 

-l 

-1 

-2 


Table of weak hypercharge Y 


^ See the previous chapter for further 
details of how this arises. 


Field 

V e 

eL 

eR 

I 

1 

2 

1 

2 

0 

h 

1 

2 

1 

2 

0 


Table of isospin quantum 
numbers I 3 and I 


As in the previous chapter, the 
field itself has 7 = 1. Note also that 
this field has Y = 0 units of weak hy¬ 
percharge. 


17(1) : D^ = dS&-yYBp{x)*, . ,, 

517(2) : D^ = d^-iglr-W^x)*. { ’ 

Since the original column vector (i/ e , eL, cr) is distinguished by the first 
two (left-handed) components having the same Y charge and I charge 
while the right-handed components have different ones, we will rewrite 
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8 The logic of this notation is that it 
puts the field with Is = 1 in the upper 
slot of the L doublet and the one with 
Is = — 1 in the lower slot. 


^ Note that the addition of mass terms 
to the Lagrangian in eqn 47.10 would 
violate the local symmetry we have set 
up. As shown in eqn 47.26, a mass term 
links the left- and right-handed parts of 
the electron field via a term —m(eLe R + 
crcl) but, since the 51/(2) transforma¬ 
tion affects the left- and right-handed 
parts of the field differently, this term 
cannot be admitted. 


the fields as 

T = 


L 

R 


where L 


■ 

e-L 


R = e R . 


(47.7) 


This prevents our having to keep writing Is and Vs in the Lagrangian. 8 
In terms of these new fields, the covariant derivatives are written: 


D^L = d^L - TjT ■ W^L + 

D^R = d^R + ig'B^R. 


(47.8) 

(47.9) 


Our resulting, locally symmetric, Lagrangian is given by 

C = Ri^id^ + ig'BJR + Lir^-'-gT-W^+'-g'B^L 

-\gW ■ (47.10) 

where G™ = d„W v - d + gW„ x W v and F™ = d- d v B„, 
which are the correct contributions from the respective gauge fields for 
SU(2) and U( 1) theories. 9 


Example 47.1 

Although apparently plucked from the air, the assignment of charges to the field com¬ 
ponents may be physically motivated as follows. We want our theory of electroweak 
interactions to have both the weak interaction and electromagnetism correctly em¬ 
bedded within it. In terms of left- and right-handed components, we may write the 
weak isospin currents as 

JA 1 = (47.11) 

and the electromagnetic current as 

Jem = Q (eL7 M e L + e R 7 M e R ). (47.12) 

Note that, in terms of net charge transfer, Je m and are neutral currents , whereas 
2 are charged currents. Since does not involve e R , but j£ m does, we need to 
add an extra current to the problem. This should ensure that we have a consistent 
gauge theory that has electromagnetism correctly embedded within it. The simplest 
solution is to write 

■*&. = ■# + 2 J Y> ( 47 - 13 ) 

where Jy is the weak hypercurrent and the factor 1/2 is included by convention. 
Equation 47.13 then immediately implies the Gell-Mann-Nishijima relation Q = J 3 + 
T . If we now expand out the expressions for and Jy we obtain 

J i = \ (Pe'y^ v e - e L 7 M e L ) 

Jy = —Vel^Ve - eLT^eL - 2e R 7 ,i eR, (47.14) 

where we have taken Q = — 1 for the electron field in the final line. The values of 
Q,l 3 and Y may then be read off from the coefficients of the / 77 /f Ay, ei, 7 /f ei J and 
terms in the expressions for Jem. .//( and ./(., in agreement with those listed 
in the tables above. 


So far this seems merely an elaborate example of writing down a La¬ 
grangian with some potentially interesting local symmetries. The amaz¬ 
ing things start happening when we add a further field into the mix. It 
is this field which, from behind the scenes, will determine the properties 
of our electroweak Universe. 



47.2 Introducing the Higgs field 437 


47.2 Introducing the Higgs field 


We now introduce a massive, complex scalar field called the Higgs field 
into the Universe. It is the interaction of this field with eL and which 
will make electrons massive particles. The Higgs field has four compo¬ 
nents which are conveniently arranged into a two-component vector 10 as 
follows: 


_ 1 ( ft 3 + i</>4 

y/2 V 01 + i</>2 


(47.15) 


This arrangement works since we have 


ft* 1 ft — (< i> + )*ft + + {ft 0 )* ft 0 — + ftl + ft\ + ft\), (47.16) 


giving something that looks like a magnitude. 

We need to know how the Higgs field transforms under U (1) and SU (2) 
transformations. The Higgs field is defined to have weak hypercharge 
Y = +1 and weak isospin 7 = 1/2. It therefore transforms according to 



(47.17) 


This choice of charges requires a covariant derivative for the Higgs field 
of the form 

D„ft = d^ft - l -gr ■ W^ft - '-g'B.ft. (47.18) 

The Higgs field will give a contribution to the Lagrangian of 


£* = + ^cftft - \{ftftf. (47.19) 


The positive mass term gives rise to the potential shown in Fig. 47.1, 
from which we see that we have set the Higgs field up for a fall: it’s 
unstable to symmetry breaking. Recall that an ordinary scalar field, 
with negative mass term, has a minimum in potential at ft{x) = 0 and so, 
classically at least, it takes the value zero in the vacuum. However, with 
its Mexican hat potential (Fig. 47.1), the ground state of the Higgs field 
is at (<ft)o ^ 0, implying that the ground state of a Universe containing 
the broken symmetry Higgs field, will be permeated by the uniform field 
{(ft ) 0 . As we shall see, it is the fact that electrons are moving through 
(and interacting with) this ether that gives them mass. 

Of course, all things related to the Higgs field would be irrelevant to 
our discussion of electrons and neutrinos if we didn’t have an interaction 
between the Higgs field and the electron/neutrino field. This is given by 


10 Again the upper slot contains the 
fields with 1 3 = 1, while the I 3 = — 1 
fields sit in the lower one. 


U{(ft) 





Fig. 47.1 The potential of the Higgs 
field. 


Ci = -GftLftR + RftL), 


(47.20) 
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where G e is the coupling strength. Putting everything together, the 
Lagrangian for the Weinberg-Salam model is given by 

C = Li^D^L + Ri^D^R + iD^YiD^) 

+ T ^<j>U - - G e (L<t>R + RtfL) 

-\g^> ■ - l -F^F^ v , (47.21) 

where the covariant derivatives are those appropriate for each field, as 
discussed above. To be clear, the massless fields described by this La¬ 
grangian are the left- and right-handed electron spinor fields, the left- 
handed neutrino spinor field, the massless gauge held and the mass¬ 
less B M gauge held. 

The Higgs held is unstable to symmetry breaking. What will hap¬ 
pen? After 10 _12 s have elapsed the ground state of the Universe will 
break symmetry, with the Higgs held choosing a unique state from the 
infinity of possible ones in the gutter of the potential in Fig. 47.1. If 
the Weinberg-Salam theory describes reality then, as this happens, the 
electron should take on a mass, the neutrino should remain massless and 
the massless photon of electromagnetism should emerge. This will turn 
out to be how things fall and, as a bonus, the existence of two more 
massive particles, the and Z° bosons, will be predicted. 


47.3 Symmetry breaking the Higgs field 


11 Since we have the non-abelian sym¬ 
metry SU( 2) then this will bear a re¬ 
semblance to the case examined in the 
last chapter. 


It is now a time t > 10 -12 s after the Big Bang, the Universe has cooled 
below 10 16 K and its ground state breaks the SU(2) (g> 17(1) symmetry 
we have been discussing. Just as in Chapters 26 and 46 we’re going to 
examine the consequences of this broken symmetry. 11 The minimum in 
potential of the Higgs field described in eqn 47.19 is not at (</>)o = 0, 
but at 4 >)o = v = . We’ll choose to break the symmetry with a 

new ground state at 


Oi)o - (^2)0 — 0 > (<^>3)0 — 0 ) (^4)0 — 0 - ( 47 . 22 ) 


Writing (</>i)o = * = we then have a g roun d state field 

<«° = ( / ) 0 = ( “ ) • (47 - 23) 

An important point here is that when we break a non-abelian symme¬ 
try, our choice of vacuum may be invariant under a subset of the original 
symmetry transformations, meaning that those symmetries aren’t bro¬ 
ken at all. The vacuum chosen here has the property that it is still 
invariant with respect to the local transformation U = e ‘(^ +l3T )“( x ) 
which, using the Gell-Mann-Nishijima relation of eqn 47.2, we see is 
equivalent to the 17(1) transformation e 1< 7“ ( 7). Invariance with respect 
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to this transformation gives our broken symmetry Universe the gift of 
Maxwell’s electromagnetism. 

We could search for excitations from this ground state field in a very 
general way by allowing every component to vary. 12 However, just as in 
the last chapter, we are at liberty to perform a different gauge transfor¬ 
mation at each point in spacetime, and thus reduce the excited field to 
the much simpler unitary gauge form 


h(x ) 

~V2 


(47.24) 


We are now ready to examine the consequences of spontaneous sym¬ 
metry breaking for this theory. We will demonstrate that it predicts 13 
the existence of (i) massive electrons and massless neutrinos; (ii) mass¬ 
less photons; and (iii) massive W ± and Z° particles. 


47.4 The origin of electron mass 


The physics of the Weinberg-Salam model is revealed by picking out the 
choice parts of the Lagrangian. Firstly we examine where we expect to 
find a fermion mass in a Lagrangian theory written in terms of left- and 
right-handed fields. If we start with the Dirac Lagrangian C = ippjp—m)ip 
and write ip — + ip R , we have 

C = Ip^flph + V’L^'i/’R + V’R^V’L + V’R^/’R 

-ijj L mip L - ip-Lmip R - ipRinipi, - ip R mip R . (47.25) 

In fact a number of these terms cancel. 14 We are left with 


C = iphflipL + ipnpi’n ~ m(ip h ip R + ip R ipi,). (47.26) 

This provides us with a clue for where to search for the masses of the 
electrons in the Weinberg-Salam Lagrangian: we are looking for a term 
containing the combination (ipi,ip R + ip R ipi,), which we expect to be mul¬ 
tiplied by a scalar quantity, which will be the electron mass for which 
we’re searching. Just such a term is to be found in the interaction be¬ 
tween the Higgs field and the lepton field, given by 

£ I = -G e (L<f>R + R<ftL). (47.27) 


We will insert our broken symmetry excited state (eqn 47.24) into this 
equation and see what happens. The first term yields 

) e R . (47.28) 


L (j)R = eL ( v + 


h(x' 

71 


Notice that the neutrino field has dropped out. This is good news since 
it implies that it will remain massless, just as we hope for a realistic 
theory. The second term gives 



eL- 


“So. for example, we could write our 
excited state field as 


4>{ x ) = 


75 [ 4 > 3 ( x ) +i04(x)] 

v+ ^2 ['/'l ( x ) +i02(x)] 


13 

Note that the masses of both the pro¬ 
ton and the neutron (which contribute 
most to the masses of everyday objects) 
are dominated by the confinement en¬ 
ergy of quarks inside them. Thus the 
popular notion of the Higgs field as 
some kind of unique giver of mass to ev¬ 
erything in the Universe is rather wide 
of the mark. 


^To show this we use the fact that the 
left- and right-handed parts are projec¬ 
tions of the full field 'ip, obtained via 




We commute the operators so that the 
projection parts sit together. We find 
that terms containing the momentum 
and mixed right- and left-handed fields 
always contain the combination 


(l-7 5 )(l+7 5 ) = l-(7 5 ) 2 


and since ( 7 s ) 2 = 1 we conclude that 
all mixed terms involving momentum 
cancel. Totally left-handed or right- 
handed terms involving the mass terms 
also contain this combination, so we 
lose those too. 


RppL = e R 


(47.29) 
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Fig. 47.2 The interaction of the left- 
and right-handed electron fields and the 
Higgs field from eqn 47.27. 


The neutrino field doesn’t feature here either and is destined to be mass¬ 
less. Putting the two halves together we obtain 

e L ). (47.30) 

The first term is in the predicted form of a scalar mass multiplied by 
4’h'ipR + V'rV’l- It predicts the electron mass to be m e = G e v. 

We conclude that, as claimed, the broken symmetry Higgs field, which 
takes the value \[2v throughout the vacuum, has provided a source of 
quantum mechanical treacle that causes the electrons to acquire mass. 
The nature of the interaction between the Higgs field and the left- and 
right-handed electrons is represented in Fig. 47.2, which depicts the in¬ 
teraction in eqn 47.27. 


Ant = -G e (e L ve R + e R i>e L ) - G e 


h(x) 

eL 7T R 


e R - 


h(x] 

71 


47.5 The photon and the gauge bosons 

Next we need to show that the theory predicts a massless photon. We’ll 
do this by looking at the place in the Lagrangian where the gauge fields 
interact with the Higgs field. Minimal coupling tells us that this is the 
term involving the covariant derivative of the Higgs field: {D^efty (D^ej>). 


Example 47.2 

Inserting the broken symmetry version of the Higgs field </>, the covariant derivative 
D^cf) becomes 


^d^h(x) 

Multiplying out, we obtain 


W 3 

wl Aw* 


D I_i4> = -- 


W£ - i 
-wl 


gv(Wl - i wl) + a -^(Wl - ml) 


w 


B u 


h(x) 

vT 


2 ^ i^d^ + vi-gWl+g'B^ + ^GgWl+g’B^ 
Finally we multiply this by its adjoint to form the term in the Lagrangian 


l r 


(D # ,flT(£>'*0) = -[dy.h(x)Y + 


9 ^~ (wl) 2 




(47.31) 

(47.32) 


+ — [(gWl — g'Bn) 2 ] + (higher order terms). (47.33) 


15 r 

Recall from Chapter 13 that such a 
linear combination of fields gives rise to 
a perfectly valid description of particles 
for a theory. 


Noting, as usual, that a boson mass always enters the Lagrangian in 
the form 4 (mass) 2 x (field) , we see that in breaking the symmetry of 
the Higgs field we have developed massive IT)) and W'l vector particles, 
with masses M^ = g 2 v 2 /2. The linear combination 15 of gauge fields 
(gWl — g'B^) has also grown massive, but this component has a mass 
that depends on the relative sizes of the coupling constants g and g'. 

Notice that three Goldstone modes of the Higgs field have been con¬ 
sumed with the result that three gauge fields have grown massive. The 
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excitations of these three fields will become the W + , the W~ and the Z° 
vector bosons in the next step. We therefore have three massive vector 
particles. Where’s the massless photon? To find it, we note the absence 
of the orthogonal linear combination (g'W^ + gB M ) and propose that 
this combination of fields is (proportional to) A^x), the photon field of 
electromagnetism. 

Since our particles depend, in a non-trivial manner, on the ratio of 
g and g ', we may simplify things by drawing the right-angled triangle 


We then define two new fields Z/ and A^ by 


16 


A„ 


M _ 


W?. 


B„ 


COS 0\y ~ sin 0\y 

sinff-yv cos6\y, 

where, as suggested by the notation, A M is to be interprete 
friend, the electromagnetic field. In terms of the new fields, 
equation becomes 



: lengths 

Fig. 47.3 Definition of the Weinberg 

= 9 '/g- 

angle #w in terms of g' and g. 



1 The fields may be written 
lently as 

equiva- 

(47.35) 

(sW3 - g'BA 

— 1 5 


our old 

{g 2 + g’ 2 ) 2 


original 

„ (g'W/+gBA 

(47.34) 

{g 2 + g' 2 ) 2 




\\d,h(x)f + (Wlf + (Wlf 
(p‘ r \p‘ 

4- 7 — — Z~ + (higher order terms), (47.36) 

4 cos^ 0 w M 


and from this equation we can read off 17 that the mass of the Z° particle 

is Mw/cos#w- 

The fact that the W tl and Z M fields have taken on a mass means that 
their propagators will have the form p 2 ^M 2 + - ie ■ Recall from our dis¬ 
cussion of the Yukawa force in Chapter 17 that this implies a potential 
varying as U(r) oc e _M l r L We conclude that, compared to the elec¬ 
tromagnetic interaction which is mediated by the massless photon, the 
weak interaction is short-ranged , falling off over a distance 1/M. 

We now have the ingredients for the non-interacting Lagrangian of 
the broken symmetry Universe (see margin on page 443). However, the 
only way to observe the predictions of the theory is via the interactions 
that the particles enter into with each other. Our final task is therefore 
to substitute our results back to see what the theory predicts for these 
interactions. 


Example 47.3 

As in the case of QED, minimal coupling allows us to see the interactions by consid¬ 
ering the covariant derivatives. Starting with 

ii? 7 M (d M + i g'B^R + i(<9 M - '-g-r ■ W„ + ^g'B^ U (47.37) 

and expanding out and using our expressions for AZ and Weinberg’s trigonom¬ 
etry, we may eliminate BWj* and g' and we obtain 
ie^d^e + ~ g sin 6 w e^eA^ 

H-^ (sin 2 6 > w e R 7 ' i eR - i cos 20 w e L 7 , 'e L + \v e 7^v e )Z [l 

cos u\\i 2 2 



17 

The mass of the W boson is given by 
= g 2 v 2 / 2. The term premultiply- 
ing Z/ in eqn 47.36 is 4 ( £f 9w = \Ml 
and hence Mz = Mw / cos • 



Fig. 47.4 The electroweak interaction 
vertices predicted by the Weinberg— 
Salam theory. 


(47.38) 
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where e = eL + eR, likewise for v and we define the field Wp = {Wp + iWp)/y/2. We 
expect the quanta of the Wp field to be two species of oppositely charged, massive 
vector particles: the W+ (the quanta created by the Wp (x) -field, with I 3 = 1) and 
the W~ (annihilated by Wp(x), with I 3 = —1). The Gell-Mann-Nishijima formula 
(eqn 47.2) tells us that W+ has Q = 1 units of electric charge, while the W~ has 
Q = — 1 units. In contrast the Zp{x) field is uncharged and we expect a single species 
of massive quantum Z Q . 


For a more complete description of 
the 18 interactions described by the 
theory see Mandl and Shaw, Chap¬ 
ter 19. 



Fig. 47.5 Some example electroweak 
processes, (a) -\-u e —> Vp + e ~; (b) 

positive pion decay involving quarks u 
and d decaying into /i+ + Vp . 


Historically, a value for the Weinberg 
angle had been estimated from ear¬ 
lier measurements of lepton scattering 
cross-sections and provided a predic¬ 
tion for the masses of the W± and Z° 
particles which were subsequently dis¬ 
covered at the predicted energies. See 
Gary Taubes’ highly recommended No¬ 
bel Dreams for the frightening history. 

20 

Often incorrectly pronounced 
‘Boatswain’ by the UK media, possibly 
as some kind of nautical throwback. 


In eqn 47.38 we have an expression for some of the interactions 18 in the 
electroweak theory in terms of the coupling g and the Weinberg angle 9\ y. 
Some important points to note from our result is that A M only couples to 
electrons and not to neutrinos, just as experiment shows the photon does. 
The vector field If), only couples left-handed electrons to neutrinos and 
doesn’t touch right-handed ones. The vector field Z M couples to left- 
and right-handed electrons with different strengths. Notice also that 
that parity operation P _1 7UP, which exchanges left- and right-handed 
fields, is not a symmetry of those parts of the interaction involving the 
W^(x) and Z^{x) fields. This is the sense in which, famously, the weak 
interaction is said to ‘violate parity’. 

The vertices for these interaction are shown in Fig. 47.4. The most 
startling of these involve the W lt field and show that the emission or ab¬ 
sorption of a W ± particle can result in the transformation of an electron 
into a neutrino (and vice versa). Some examples of particle processes 
allowed by the weak interaction are shown in Fig. 47.5. 

Note what the theory predicts for the electric charge. In QED the 
charge is the coupling constant in front of the fermion-photon interaction 
term: — Taking q — \e\ for a single electronic charge we read 

off from eqn 47.38 that the electric charge is given by 

|e|=<jisin0w (47.39) 

The Weinberg-Salam electroweak theory and its verification are two 
of the greatest achievements of twentieth century science. The Z° and 
W ± particles were detected with masses Mz = 91.19 GeV and = 
80.40 GeV and correspond to a Weinberg angle (via cos#w = Myj/Mz) 
of ~ 28°. 19 The final prediction of the theory is the existence of a 
scalar field excitation in the Higgs field, known as the Higgs boson. 20 
At the time of writing, experiments at the Large Hadron Collider have 
found a particle, thought to be the Higgs, with a mass « 125 GeV. 

Thus in summary our current picture is of a symmetry-broken Higgs 
field that permeates the Universe. Other fields are set to zero in the 
vacuum and fluctuate around that, giving rise to particle—antiparticle 
excitations that wink in and out of existence; the Higgs field is nonzero 
by default. With no Higgs field, the electron and electron neutrino would 
be identical particles, and the W and Z particles, and in fact all standard 
model fermions, would be massless. That indeed was the situation in 
the very early Universe before the Higgs field became symmetry broken. 
But immediately after the electroweak phase transition (that occurred 
perhaps 10~ 12 s after the Big Bang) the present broken-symmetry state 
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emerged and the masses of these particles became simply a function 
of how strongly they couple to the Higgs field. And couple they will, 
because the Higgs field is both nonzero and everywhere. 


Chapter summary 


• In the period t < 10 12 s after the Big Bang the leptons of the early 
Universe enjoy an internal SU( 2) ® 17(1) local symmetry. 

• The Weinberg-Salam electroweak theory describes the breaking of 
the symmetry of the Higgs field with the result that a nonzero 
Higgs field permeates the vacuum, providing electrons with mass. 

• This also results in the emergence of the photon and three-vector 
particles W + , W~ and Z°. The excitations of the electromagnetic 
field An do not acquire a mass, whereas the W ± and Z° particles 
become massive. 


Non-interacting electroweak La- 
grangian 

Co = e(i0 - m e )e + v e (ip)ve 

+ \(de h f - \ m l h2 

where = dpWp — d v W ^ and sim¬ 
ilarly for Zpv. 


Exercises 


(47.1) Show that the broken symmetry ground state 

vacuum expectation value ^ ^ ^ is in¬ 

variant with respect to the transformation U = 

e i (T+ I 3T 3 )a(x) ' 

(47.2) Verify eqn 47.26. 

(47.3) By considering the covariant derivative of the Higgs 
field, verify eqn 47.33. 

(47.4) Verify eqn 47.38. 

(47.5) Fermi’s theory of the weak interaction. 

(a) Read off the interaction vertices for weak in¬ 
teractions between electrons and neutrinos from 
eqn 47.38. 

(b) Draw a Feynman diagram for negative muon 
decay (i~ —> e~ + P e + and show that, at low 
momentum, the process is described by an interac¬ 


tion Hamiltonian 


2 

rr 9 - a 9<*P _ /5 

Hl = 


(47.40) 


(c) Defining the current for the lepton species l as 
J“ = 17“(1 — 75 )vi, show that the interaction for 
the muon decay can be recast as 


Hi = 


(47.41) 


where G = 


This is known as the Fermi 


current-current interaction. 

(d) For muon decay G « 10~ 5 mp 2 , where m p is 
the proton mass. Comment on the ‘weakness’ of 
the weak interaction. 
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MAJORANA 


MAJORANA 

FERMION 


FERMION 

(particle) 


(antiparticle) 


Tragically I was an only twin 

Peter Cook (1937-1995), title of an unrealized project. 

Can a fermion particle be its own antiparticle? That is to say, if we 
take the charge conjugate of the creation operator of a Fermi particle 
cj,, can it return the same operator C^c^C = 4 without creating an 
inconsistency in the theory? We know that particle excitations of the 
real scalar held are identical to antiparticle excitations of the scalar held. 
We also know that the photon excitations in the electromagnetic held 
are identical to antiphotons. Scalar particles (spin-0), photons (spin-1) 
and gravitons (spin-2) can be described by real helds: <f> = ft. Since 
creates a particle and (j> an antiparticle, we expect these particles 
to be identical to their antiparticles. The Dirac theory of the electron 
(Chapter 36) seems to require complex helds (though see below), and so 
it might seem at first glance as if fermion helds necessarily involve dis¬ 
tinct particle and antiparticle parts. This would explain why electrons 
and positrons (anti-electrons) are distinct particles, and also why even 
the electrically-neutral neutron is not identical to its antiparticle, the 
antineutron. However, this is not the whole story, and in this chapter 
we examine the steps required to formulate a theory containing fermions 
whose particles and antiparticles are identical or, if you prefer, symmet¬ 
ric. 


Fig. 48.1 A majorana fermion is its 
own antiparticle. 


48.1 


The Majorana solution 


Etorre Majorana (1906-[1938+rr]) had 
a short, but brilliant, career in theoret¬ 
ical physics which was cut short when 
he disappeared whilst travelling by pas¬ 
senger ship from Palermo to Naples. It 
has been variously suggested that he 
committed suicide, was murdered, or 
escaped to Argentina. 


Etorre Majorana came up with a theory in which a fermion particle was 
its own antiparticle (see Fig. 48.1). Such particles are called Majorana 
fermions and are unchanged by the act of charge conjugation. 


Example 48.1 

For charged particles we know that the charge conjugation operator swaps particles 
for antiparticles, so for a complex scalar field we have, for example: C —1 apC = bp. 

Let’s start by taking a naive view and examining a mode expansion of a classical 
scalar field (that is, the expansion of a function; not an operator): 



d 3 p 1 
(2?r)t (2E p )i 


(ape- 1 *”® + a; e * *) . 


(48.1) 
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Notice that the second term is the complex conjugate of the first and that (j){x) is 
real. Compare this with the case of a (classical) complex scalar field: 

i>{x) = [ -— r (a p e~ ipx + b*e ipx ) . (48.2) 

J (27T)i (2E p )2 V P > 

Here the complex conjugate of the first term yields something which is not identical 
to the second term, i.e. the particle is different from the antiparticle. 

Speaking very roughly, charge conjugation involves swapping each field for its 
complex conjugate and so a real field like <f>(x) will describe particles that are their 
own antiparticles. This will not be the case for the complex field U>(x). 


Motivated by the previous example, we will search for a real (rather 
than complex) solution to the Dirac equation. Such a solution follows 
if the Dirac equation itself is real. Whether the Dirac equation (10 — 
m)i> = 0 is real or complex depends on the choice of the 7 matrices. 
Recall that there are many possible 7 s out there, constrained only by 
the requirement that { 7 At , 7 I/ } = 2 . If we can find a set of 7 s which 
are purely imaginary, then the Dirac equation itself will be real and 
consequently so will its solutions. 

Majorana found such a set of purely imaginary 7 matrices, given in 
eqn 48.3 in the margin. Using the Dirac equation given by these matrices 
we will find solutions v(x) which will have the property that v(x) = 
v*{x), which is known as the Majorana condition and reflects the 
fact that the solutions are identical to their complex, or more precisely, 
to their charge conjugates. 

Having demonstrated the possibility of Majorana fermions existing as 
viable quantum mechanical entities, we would now like to make contact 
with our previous approach and describe Majorana’s solution in the 
chiral representation, which involves stacking up pairs of two-component 
Weyl spinors to make four-component Dirac, or in this case Majorana, 
spinors. We therefore need to consider how charge conjugation works for 
Weyl spinors. (Note that we’re treating our spinors as c-number wave 
functions for the moment; fields will follow shortly .) 1 


Example 48.2 

There is a recipe for obtaining the charge conjugate \Pc(x) of a Dirac spinor Tfx) in 
chiral representation. It is 


Majorana’s 7 matrices: 



(48.3) 


1 Note also that, throughout the rest 
of this chapter, we reserve the sym¬ 
bol 7 for two-component Weyl spinors 
only, which may be left-handed (7 l) or 
right-handed (7 r), while the symbols 
'I' and v denote four-component Dirac 
and Majorana spinors respectively. 


3> c (a0 = C-^OUC = Cq'T'Ot). 


(48.4) 


with Co = — i 7 2 . As an example, we could start with a Dirac spinor T = 

containing only a left-handed Weyl spinor and take the charge conjugate as fol¬ 
lows: 







0 

i 


(48.5) 


This gives a component in the lower slot involving the original left-handed Weyl 
spinor . 2 You may verify that conjugating the charge twice returns the original spinor. 


2 


Note that charge conjugation of the 


right-handed spinor gives 

( 


0 

bR 
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Here’s the solution: in the chiral basis a Majorana spinor may be built 
out of a left-handed Weyl spinor and its charge conjugate as follows: 


v{x) = 


0 


0 


ia^Kx) 


, (48.6) 


that is v(x) = 


, where we write the charge conjugate of a 


^We could also have started with a 
right-handed spinor and then ^built a 

Majorana fermion i/ = ^ 


V’R 

The point is that we only need to 
start with either a left-handed spinor 
or right-handed spinor. 


V’l(z) 
iph,c(x) 

Weyl spinor as i/'l.c = io' 2 V , L' This solution v(x) obeys the all-important 
property that vc(x) = v(x), which is a more general expression of the 
Majorana condition above. Notice that a Majorana particle may be 
built starting with only a left-handed Weyl spinor, putting its conjugate 
part into the slot where the right-handed part usually lives. 3 This is 
different to the ordinary Dirac fermion we met in Chapter 36, which has 
independent left- and right-handed parts. 

48.2 Field operators 


So far we’ve been working with wave functions. To make the the¬ 
ory respectable we should write things in terms of field operators. 
Like the wave functions, these fields must obey the Majorana con¬ 
dition, which expressed in terms of charge conjugation operators is 
v(x) = vc(x) = Cqv*{x). A slightly jarring abuse of notation here 
is the complex conjugate of the field operator, since the complex con¬ 
jugate of a creation or annihilation operator is not, strictly speaking, 
defined. In taking the complex conjugate we are seeking to avoid the 
matrix transpose that goes along with Hermitian conjugation. There¬ 
fore a fussier, but more correct, way of writing the conjugation operation 
would be z>t T However, we’ll stick with the complex conjugate notation 

/ u s *(p) \ 

where one should note that u s *(p) = ( if*\p) J an< ^ we ^ ree ^ abuse 
the notation with the understanding that here a* = aj,. 


^Consider, for example, the 

charge conjugation of the posi¬ 
tive helicity antiparticle spinor 
from Chapter 36: — i , y 2 v s *(p) = 

' \ 

VE + Jp\ 

o 

V -^E^\p\ 

f \ 

= M =MS(P) ’ 

V o / 

i.e. a positive helicity particle spinor, 
as claimed. 



Example 48.3 

For a Dirac particle, recall that the field annihilation operator is given by 

*(*)= [ (48.7) 

J (2tt)§ (2E p )5 V V ’ 

whose charge conjugate is C~ = Co^/*(x) 

= [ (-n 2 u s *(p)aip^ ip x - h 2 v s *(p)bspe-*-*) . (48.8) 

j (27T)i (2E p )2 V V ’ 

Moreover, using the explicit forms of u s (p) and v s (p) it may be shown 4 that 
—i 7 2 u s *(p) = v s (p) and —vy 2 v s *(p) = u s (p) leading to a charge conjugate field 

C-^WC = [ TT- — l — J2 (v s (p)aipj pX + uHpybspe-^*) . (48.9) 

j (27T)§ (2E p )2 V V J 

For what follows, note that this can be made the same as the original Dirac field if we 
were to make the replacements a\ p —> b\ p and b sp —> a sp , from which we conclude 
that the prescription indeed enacts charge conjugation on the Dirac field in that it 
returns a Dirac field with particle operators exchanged for antiparticle operators. 
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Finally we will write down a Majorana field. We start by noting that 
the general form of a quantum field is 


(Quantum Field) = (Particle part) + (Antiparticle part) 

= (Particle part) + CA 1 (Particle part)C. 


If particles and antiparticles are identical (which is what we’re after) then 
C _ 1 aJ p C = a\ p and so C~ 1 u s (p)a\ p C = — i'y 2 u s *(p)a\ p . To construct the 
Majorana field we therefore replace the antiparticle part of the Dirac field 
v s (p)bl p with —iy 2 u s *(p)al p . The result is the Majorana field 


H x ) 


d 3 p 1 

(27T)§ (2 E p )l 


^2(u s (p)a sp e lp 


i 7 V*(p)at p e^) . 


(48.10) 

As may be checked, the Majorana held does indeed enjoy the Majorana 
condition: 


CA^aOC 


d 3 p 1 
( 2 tt )§ ( 2 E p )h 


£ (-iy 2 u s *(p)al p e ip - 


= z>( x). 


u s (p)a sp e~ ipx ) 

(48.11) 


Example 48.4 

If we start with a left-handed (momentum space) Weyl spinor u£(p) the Majorana 
field takes the form 


>(*) = / 


d 3 p 


1 


(2tt)2 (2E p )2 


E 


«£(p) 

o 


o 

i <? 2 ut*(p) J ^ 


(48.12) 

As in the case of wave functions, we see that Majorana fields may be built out of 
Weyl spinors of only one handedness. 


Although the argument of this chapter has been very formal, the point is 
a simple one: it is indeed quite possible to build a Majorana held whose 
excitations are Majorana fermions. We will now discuss the physics of 
these particles. 


48.3 Majorana mass and charge 


First we consider the mass of the particle excitations of the Majorana 
held. Recall that Weyl spinors are necessarily massless. Massive Dirac 
spinors must contain independent left- and right-handed parts and the 
particles may be thought of as oscillating between the two. The mass 
term in the Dirac Lagrangian may be written in terms of Weyl spinors 


mo^L^R + *r*l), 


5 

See the previous chapter for an expla¬ 
nation of this. 


(48.13) 
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^Notice that we don’t have terms 
tf L 4> L and 'Lr'I'r which are not 
Lorentz invariant and therefore are for¬ 
bidden. 


7 

Note that the N objects in this equa¬ 
tion are built from four-component Ma¬ 
jorana spinors v, so themselves have 
eight components. 


where we write V&l = ^ J and ’Lr = ^ ^ J . Vitally, eqn 48.13 is 
a Lorentz invariant quantity. 

As Majorana solutions may be written in terms of Weyl spinors of 
a single chirality we might question whether it is possible to identify a 
massive Majorana field. This worry looks to be valid since, as shown 
in the exercises, a mass term in the Lagrangian of the form v(x)v(x) 
vanishes if we treat the fields in the Lagrangian as c-numbers, as we do 
for the Dirac case. However, it turns out that we may define massive 
Majorana fields, as long as they anticommute. In that case we may 
have massive Majorana fields built out of left-handed Weyl spinors only, 
with mass tol, or Majorana fields built from right-handed Weyl fields 
only, with mass tor. Turning to the Lagrangian, we may write the most 
general, Lorentz invariant mass term for Majorana fields as 7 

(Vl,cMAT l + iY L MAT L ,c) , (48.14) 


where 


N T, = 


ZA 

Z'R.C 


and where 
/ —icr 2 ip 


0 


4>l 

0 


ATl.c = 


*A,C = 


" L ’ c ), 

M = 


VR J 


v in D 

0 > 

i 

( o 

1 J 

),v K = 

V V’R 


to d 


(48.15) 
and ^ R ,c = 


. Here we have three types of mass: the Dirac mass tod 


along with the right- and left-handed Majorana masses to r and Tol- 


Example 48.5 

Working with Majorana fields v(x) built from left-handed Weyl spinors, we set mo = 
ttir = 0 and m l / 0. We may then write a Majorana Lagrangian as 

C = if/i fiv- i(i>L,C^L^L + ^L m L zy L,c)> (48.16) 

where v = zv L + ^l,C- Varying this Lagrangian with respect to leads to the 
Majorana equation for the field 0given by 

io-'^d^V’L - mLiiT 2 ^£ = 0, (48.17) 

which may be recast in the form of a Dirac equation 

(i$ — mi_,)v(x) = 0. (48.18) 


To complete this cavalcade of formalism, we briefly turn to the electro¬ 
magnetic charge of the Majorana field: it has none. This is, of course, 
necessary for the particle excitations of this field to be identical to the 
antiparticles. It can also be seen from the field equations by noting that 
if v = z'l+z'l.C) then if we make the transformation —> v\^e la , we must 
have vi,'C zA.ce -10 (because v\^ and i/l,c are related by complex con¬ 
jugation). It is therefore impossible to define a (7(1) transformation for 
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Majorana spinors which simultaneously provides both upper and lower 
slots with the same phase factor e la . The Majorana equation cannot, 
therefore, be made invariant under local 17(1) transformations. Put the 
other way: a particle carrying the conserved 17(1) charge of electromag¬ 
netism cannot be a Majorana particle. 

Having formulated this theory we now ask whether it’s of any use. 
Indeed, for many years it seemed to be an interesting solution in need of 
a problem. However, in recent years, several such problems have arisen. 

• The vast number of emergent quasiparticles in condensed matter 
physics present us with an ideal playground for searching for ex¬ 
otic excitations such as Majorana fermions. In a semiconductor or 
a metal, electrons and holes look different because they are oppo¬ 
sitely charged, and so it does not seem possible that the particles 
(electrons) and antiparticles (holes) can be symmetrically related. 
A superconductor, on the other hand, blurs the distinction between 
electrons and holes and so seems like a possible environment for 
realizing Majorana fermions. One of the great things about su¬ 
perconductors is that they screen electric and magnetic fields, and 
so charge is not a good observable. The bogolon excitations of a 
superconductor superpose electrons and holes and, under rather 
special circumstances, it is possible that the quasiparticles of a 
superconducting system are Majorana fermions. An example of 
such a circumstance involves a superconductor in the presence of 
vortices, which changes the equations of motion of the electrons 
and can lead to the trapping of electron hole pairs which can be 
described as Majorana fermions. 

• At first sight neutrinos seem to be well described as solutions to 
Weyl’s equation. All neutrinos are left-handed massless particles 
with negative helicity whereas all antineutrinos are left-handed 
massless particles with positive helicity. However, the discovery 
that neutrinos emitted from the Sun with one flavour may be de¬ 
tected with a different flavour suggests that these particles actually 
possess a nonzero, but small, mass. Whether this is a Dirac mass 
is an open question and there exists the possibility that neutrinos 
are actually Majorana particles with Majorana mass. 

• Consider the possibility that for every species of boson in the Uni¬ 
verse there exists a corresponding species of fermion (and vice 
versa) with the same mass. This would require the Lagrangian 
describing our Universe to exhibit a symmetry known as super- 
symmetry. In a supersymmetric Universe we should expect the 
existence of the ‘selectron’: a spin-0 particle with the mass of an 
electron; and the ‘photino’: a spin-1/2 massless particle. If the 
photino mirrors the properties of the photon then it must be its 
own antiparticle. This implies that the photino is a Majorana 
fermion, as will be the ‘Higgsino’ and various types of ‘gaugino’. 
(Supersymmetry is a subject replete with wonderful terms, and 
happily the supersymmetric partner of the W boson is called a 



450 Majorana fermions 


‘wino’.) Despite much effort, any direct evidence for supersym¬ 
metric particles has so far proved to be elusive. 


Chapter summary 

• A Majorana fermion is its own antiparticle and is a solution to a 
version of the Dirac equation in which the 7 -nratrices are purely 
imaginary. 

• We have shown how to build Majorana fields from Weyl spinors of 
a single handedness and we have written down a Lagrangian that 
leads to their equation of motion. 


Exercises 


(48.1) Show that two operations of the charge conjuga¬ 
tion operator used in eqn 48.5 return the original 
particle. 

(48.2) Show that — ry 2 u s *(p) = V s (p) for a negative helic- 
ity particle spinor. 

(48.3) (a) Show that, for a Majorana held v built from 
left-handed Weyl spinors: 

vv = — i'i/'L o- 2 iph + h.c., (48.19) 


where h.c. denotes the Hermitian conjugate. Ex¬ 
plain why this must vanish if the fields are repre¬ 
sented by c-numbers. 

(b) Show that vv won’t vanish if the fields are rep¬ 
resented by anticommuting Grassmann numbers. 

(48.4) Verify eqns 48.16 and 48.17. 

(48.5) Verify that eqn 48.17 may be recast in the form of 
a Dirac equation as claimed. 




Magnetic monopoles 



One would be surprised if nature had made no use of it. 

Paul Dirac (1902 1984), on magnetic monopoles. 

Do magnetic monopoles exist? We are taught at our Mother’s knee 
that Maxwell’s equation V • B = 0 insists that they do not. However 
it is more the case that they have never been observed rather than 
that their existence is a physical impossibility. In fact there seems to 
be nothing preventing the existence of the magnetic monopole and in 
this chapter we will investigate the properties of these objects. We will 
present descriptions of two rather different sorts of monopole: those 
of Paul Dirac and those of Gerard ’t Hooft and Alexander Polyakov. 
The existence of Dirac’s monopoles are not mandated by any theory. 
However, we’ll see that if we really do live in a Universe that results 
from the sort of non-abelian gauge theories with symmetry breaking 
discussed in previous chapters, then the existence of’t Hooft-Polyakov 
magnetic monopoles is almost inevitable. 


49.1 Dirac’s monopole and the 

Dirac string 451 

49.2 The ’t Hooft—Polyakov 

monopole 453 

Chapter summary 456 

Exercises 456 


49.1 Dirac’s monopole and the Dirac 
string 


The discovery of a certain kind of magnetic monopole with a magnetic 
charge g wouldn’t be a disaster for Maxwell’s electromagnetism. We 
could include them in his equations, which would become rather more 
symmetrical: 


V • E = p e , V • B = p m , 

„ „ T dB „ „ T dE 

Vx£ = -J m -—, VxB = J e + —. 

at at 

The extra terms involve p m , the magnetic charge density, and J m , the 
magnetic current. 1 We would also need to adjust the Lorentz force law 
which would become 


Remember we are using Heaviside 
units. 


1 In conventional electromagnetism 
Pm = 0 and J m = 


F = q(E + v x B) + g (B — v x E) . (49.1) 

This variety of magnetic monopole was first considered by Paul Dirac 
and is therefore known as a Dirac monopole. 

As a first step in our investigation of the properties of these objects, 
let’s recap the physics of electric monopoles, which certainly do exist! 
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Fig. 49.1 Monopoles (electric or mag¬ 
netic) are sources or sinks of field, be¬ 
cause V • E = qS^(r) and V B = 
gS^ 3 \r). 



They are the charges of classical electromagnetism (see Fig. 49.1). For 
an electrical charge q at the origin, Maxwell’s equation yields 

V ■E = qS ( ' 3) (r), (49.2) 


whose solution, due to Monsieur Coulomb, takes the form 

£ =4^F = - V (i^i)- (49 - 3) 

The electric flux $e = J E ■ clS 1 through a spherical surface surrounding 
the electric monopole is then just q. 

If analogous magnetic monopoles exist then their behaviour should be 
governed by similar equations. Instead of the Maxwell equation V • B = 
0 , we should have for a magnetic monopole of charge g at the origin 

V • B = gS {3 \r), (49.4) 


and a radial field equation (that is, the analogue of Coulomb’s law): 


B = 9r 

47T | r* | 3 


(49.5) 


The magnetic flux through a sphere surrounding the magnetic monopole 
is given by <J> M = 5- 


Fig. 49.2 You can think of a monopole 
as originating from an extremely long 
bar magnet. 



Fig. 49.3 The vector field from a 
monopole can be described everywhere 
except along a line of singularities (the 
Dirac string). A N is defined for 0 < n. 
A s is defined for 6 > 0. 


2 

The expression for the curl of a vector 
function only containing an azimuthal 
(e^) component in spherical polars is 


Vx A = 


1 d 
r sin 6 d0 


[A# sin 6 ] e r 


ld_ 

r dr 


[rA^] eg. 


Example 49.1 

The problem with the Dirac monopole comes when you try and write down the 
magnetic vector potential A. There is no single function, without singularities, that 
does the job everywhere; actually, this shouldn’t be a surprise. If you could find such 
a function, then the flux 4 >m out of a closed region S (of volume V) would be 

$ M = j VxA • dS = j V • ('V x A) dV = 0, (49.6) 

B 


whereas this should come out as 4 >m = g. One way of understanding this physically 
is to think of the monopole originating from an infinitely long and very thin solenoid, 
or permanent magnet, which has one end at the origin and the other end very far 
away (somewhere out at infinity). This looks like a monopole at the origin (see 
Fig. 49.2), and the physical Universe can be modelled everywhere, except at a line of 
singularities where the solenoid or permanent magnet lies. To illustrate this problem 
(see Fig. 49.3), consider the function A N given by 


a n = 9 (1-cos 0 ) t 

47 rr sin 6 ^ 


4nr 


■ tan — e.A, 


(6 < 7r). 


(49.7) 


This is well defined everywhere except at 6 = n and taking the curl of it quickly 2 
brings back eqn 49.5. The function blows up at 6 = n for all radii r, and so this 
magnetic vector potential is singular along the whole of the negative 2 :-axis. It’s as 
if the monopole carries around a tail of singularities. This tail is known as a Dirac 
string. Similarly, we can define another function 


A b = — 


g (1 + cos#) , 


—— cot -e.^ (0 > 0), 
4?rr 2 * V ' 


(49.8) 


47 rr sin 6 

which does the same job but has a singularity at 6 = 0, so although we’ve removed 
the tail of singularities along the negative 2 :-axis, we now have them along the positive 
2 :-axis: we’ve only managed to move the problem, not remove it. This illustrates the 
point that no single function will work everywhere. 
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One interesting consequence of these results is that the two magnetic vector po¬ 
tential functions A N and A s only differ by 3 


A N - A s 


_9 _ 

2 irr sin 9 ^ \ 2 ir ) 


(49.9) 


which looks right since A v and A s correspond to the same field B and so they 
should differ only by a gradient of a scalar function. 


'■’Use has been made of the expression 
for grad in spherical polars: 

9/, I <7/ , 1 df . 

Vf — ~^~ er + -~Th e 8 4 - r ~h~al e <l>- 

or r 09 r sin 0 o<p 


The two functions A N and A s represent different choices of electromag¬ 
netic gauge. The wave function if) of an electric charge q surrounding 
the monopole can be written in either gauge, but the two expressions 
will be related by a gauge transformation: 


ip S (r) = ip N (r) exp 




(49.10) 


Thus as we take </> from 0 to 27r, the wave function will only be single¬ 
valued if 


g = n-, (49.11) 

q 

where is an integer. 4 This equation tells us that if the electric charge q 
is quantized, then so is the magnetic charge g. The argument also works 
in the other direction: if we were to discover quantized magnetic charges 
then this would explain the existence of quantized electric charge. This 
is Dirac’s formulation of monopoles. There is no necessity for them to 
exist, but if they are found, they provide a rationale for the quantization 
of electric charge. 
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4 This is called Dirac’s quantization 
condition. 


5 

In fact, taking the Dirac monopole to 
be a point particle is something of a ne¬ 
cessity since, like the electric monopole 
it represents a singular solution to the 
Maxwell equations. 


So far we have been considering single particles: the electrically charged 
particle and, by analogy, the magnetically charged particle. 5 Now we ask 
whether there is a quantum field theory for which a magnetic monopole 
represents a stable solution? There is: magnetic monopoles are topolog¬ 
ically stable solutions of a non-abelian gauge theory with spontaneous 
symmetry breaking. 6 As we will show, these field theory monopoles, pro¬ 
posed in 1974 by ’t Hooft and Polyakov, are rather different entities to 
the Dirac monopoles of the last section. 


6 Since, in Chapter 47, we saw that such 
a theory seems to describe the elec- 
troweak part of Nature, it seems likely 
that magnetic monopoles might be re¬ 
alized in our Universe! There should 
also, we hope, be a good reason why 
we haven’t observed them yet. 


Gerard ’t Hooft (1946- ) 
Alexander Polyakov, (1945- ) 


Example 49.2 

The monopole solution is most easily seen if we review the topological objects of 
Chapter 29. There we started with a Lagrangian which was unstable to sponta¬ 
neous symmetry breaking. Then the topological solutions arose by considering time- 
independent solutions which sat in a different vacuum at different points in space: 

• The simplest was the kink in (1+1) dimensions, which was a field existing in 
different vacuum states at ±oo. 
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• The second simplest was the vortex in (2+1) dimensions, where a field with 
broken U(l) symmetry enjoyed different vacua at all points on a circle at 
spatial infinity. 

Obviously the next step is to examine (3+1) dimensions and a configuration with 
broken SO (3) symmetry which enjoys a different vacuum at all points at a spherical 
shell at infinity. 



Fig. 49.4 The hedgehog solution. 

y 

Recall what happens in such a the¬ 
ory. Owing to the positive mass term, 
the symmetry is spontaneously broken 
and the Higgs field <j> takes on a con¬ 
stant value in the ground state |4>o| = 

(1C\") ^ ' The broken symmetry ground 
states form a spherical shell S 2 in 4>- 
space of radius |4>o|. Previously we 
have chosen to direct the 4»-field along 
the 4>3-direction. In that case Wj* 
is equivalent to the usual electromag¬ 
netic field Av and we recover Maxwell’s 
electromagnetism. We therefore take 
the charge of the theory to be q : the 
electromagnetic charge. (Although of 
course we know that our electromag¬ 
netism resulted from symmetry break¬ 
ing in a SU (2) x U (1) gauge theory, the 
SO( 3) theory will be sufficient for ex¬ 
amining monopoles solutions.) 


^Recall that the vortex is one such ob¬ 
ject. 

^See Section 29.3. 


The solution with this topological property in (3+l)-dimensional space- 
time is the monopole, which has the form as \r\ —> oo that 

= (k|->oo), (49.12) 

with A a constant. The thing to notice here (as with the vortex) is 
the distinction between internal and real space components. The vector 
$ lives in internal isospace but the radial vector r lives in real space. 
However, eqn 49.12 links these two together. In components we have, 
for example <+|r| —> oo) = A-^ , that is, along the spatial 1-direction 
(also known as the ^-direction) the field points along the internal 1- 
direction. Similarly along the y-direction the field points along the 
internal 2-direction and so on. In this sense, the field points radially 
outwards at infinity, which is why Polyakov called this the hedgehog 
solution. This is represented in Fig. 49.4. 

Our task is now to find a theory for which the hedgehog is a solu¬ 
tion and then to show that it has the electromagnetic properties of a 
monopole. Let’s consider a gauge theory with local SO{ 3) symmetry, 
defined by a Lagrangian 

1 m 9 1 

C = -(D»*) ■ (£>„*) + — * $ - A(* • $) 2 - -G^ ■ G^, (49.13) 

where G^ is the gauge field tensor and _D^<f> is the covariant derivative 
given, respectively, by 

= d^Wv-dvW^ + qW^xWv, 

D^ = d^-q&xW^ (49.14) 

with q the SO(3) charge of the theory (which will be shown to be equiv¬ 
alent to electric charge a little later). Notice that the Lagrangian we 
have written is unstable to spontaneous symmetry breaking so $ is a 
Higgs field. 7 As stated above, we seek a topological field configuration 
for this theory with a different ground state at each point on the spatial 
boundary of the theory, which is the spherical shell S 2 at |r| —» oo. The 
resulting hedgehog solution of the Lagrangian in eqn 49.14 is given by 

$ =(4 x) 5 R (M-oo). (49.15) 

As with all time-independent soliton-like solutions in more than one 
spatial dimension, 8 Derrick’s theorem 9 tells us that this object, taken 
alone, gives rise to an infinite energy. We therefore need to appeal to 
the gauge field to stabilize the monopole and guarantee that its energy 
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remains finite. For this to be true we require the gauge field for 
|r| —> oo to be (see Exercise 49.1) 

w; = -£ iab ^, 

qr z 

W 0 a = 0, (49.16) 

where here a is the internal index and i is the real-space vector index. 


Example 49.3 

We have identified a topological object that can exist in (3+l)-dimensional spacetime, 
but we’ve yet to find its electromagnetic properties. This turns out to be a rather 
non-trivial matter since we have three internal components of the field strength tensor 
G/_iu and, therefore, three different choices of magnetic field 10 B ax = — (with 

no sum implied over repeated indices). For the more usual uniform broken symmetry 
ground state, where points along the 3-direction, the choice is simple, the a = 3 
component of G^ u reduces to the usual form F^ u . More generally for an arbitrarily 
oriented ground state (i.e. one not necessarily pointing along the 3-direction) we have 

* -G>. (49.17) 


For any situation other than the ground state of 4* we need a more complicated 
expression containing terms in _D^4> which vanish in the ground state. 11 Fortunately, 
these complexities may be ignored since we only require the magnetic flux emerging 
from the hedgehog solution, for which we find 

k 

Fij = F*j = (49.18) 

gjr\ J 


Finally we’re ready to see what the hedgehog field predicts for the magnetic field of 
a monopole: it is 

B= 1 - 7 -, (49.19) 

q |r| J 

which is exactly the radial field expected for a monopole. It results in a magnetic 
flux 

47T 

$m = — • (49.20) 

9 


The hedgehog solution is therefore confirmed as a magnetic monopole. Comparing 
the flux from the hedgehog with the flux expected from a magnetic monopole 4> = g, 
we must have that = g or, on restoring factors of h , 


9 = 


2 h 
9 ' 


(49.21) 


Comparing this with the result for a single Dirac monopole, for which g = h/q , we see 
that the magnetic charge of the’t Hooft-Polyakov monopole is twice the magnitude 
of the Dirac monopole. 


Notice how the ’t Hooft-Polyakov monopole and Dirac monopole are 
very different beasts. Dirac monopoles are singular mathematical so¬ 
lutions of electrodynamics which necessitate the introduction of point 
particles as the sources of the magnetic flux. These particles have arbi¬ 
trary spin and mass. In contrast ’t Hooft-Polyakov monopoles are non¬ 
singular solutions arising from the interaction of a non-abelian gauge 
theory and a scalar field. All of their properties, such as their mass, are 
determined by the original theory. 12 


10 Here, as elsewhere in this chap¬ 
ter, a = 1,2,3 is the internal index 
and i,j,k are vector indices. Com¬ 
pare these three choices with the sin¬ 
gle choice from conventional electro¬ 
magnetism: B l = -e ijk F jk (no sum 
implied over repeated indices). 


11 See Exercise 49.3. 


12 

The fact that these monopoles are so¬ 
lutions to the sort of broken symme¬ 
try theory that describes our Universe 
leads us to question why we have not 
yet observed any of them, ’t Hooft 
estimated the mass of these beasts as 
Mw /ol ~ 137Mw ~ UTeV, making 
them very heavy indeed and outside the 
range of our experiments. Their exis¬ 
tence therefore remains an open ques¬ 
tion. 
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Chapter summary 

• Dirac’s magnetic monopole leads to a singularity in the magnetic 
vector potential which cannot be removed (the so-called Dirac 
string). The quantization of magnetic charge of this monopole 
is inextricably linked to the quantization of electric charge. 

• The’t Hooft-Polyakov monopole is a topological object in a non- 
abelian gauge theory. 


Exercises 


(49.1) Show that in order to kill the divergent derivative 

of the monopole Held «f> = (ix) ” yj at r ->oo we 
must introduce a covariant derivative with a gauge 
field with components VI),. 


Wo = 0. (49.22) 

(49.2) Verify eqn 49.19. 

(49.3) We can generalize the definition of the Maxwell 
electromagnetic field F fIU so that it describes elec¬ 
tromagnetism in the presence of the topologi¬ 
cal monopole and reduces to ordinary electromag¬ 
netism under the normal circumstance of a broken 
symmetry Universe with components <f> 3 ^ 0 and 


Vb 1,2 ' = 0. We define the modified electromagnetic 
field tensor as 

•TV" = — ^ 1^,3 • (D m $ x £>„<!?). (49.23) 

(a) Check that this reduces to the ordinary form of 

as claimed. 

We may also define A M ' kV)., which, in 

a broken symmetry Universe without monopoles, 
picks out the part of the potential that gives ordi¬ 
nary electromagnetism. 

(b) Show that this leads to a cleaned up field equa¬ 
tion 

= d^Av-dvA^- ^ -(c^tfrxd^). (49.24) 

(c) Starting with eqn 49.24 verify the algebra that 
leads to the value of the B-field in eqn 49.19. 




Instantons, tunnelling and 
the end of the world 



There is a theory which states that if ever anybody discov¬ 
ers exactly what the Universe is for and why it is here, it 
will instantly disappear and be replaced by something even 
more bizarre and inexplicable. There is another theory which 
states that this has already happened. 

Douglas Adams (1952-2001) 


In the true vacuum, the constants of nature, the masses and 
couplings of the elementary particles, are all different from 
what they were in the false vacuum, and thus the observer is 
no longer capable of functioning biologically, or even chemi¬ 
cally. 

Sidney Coleman (1937-2007) 
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One aspect of conventional quantum mechanics that we have not yet 
addressed with quantum field theory is tunnelling. In this chapter we 
will discuss a class of objects known as instantons which are found 
in the path integral version of quantum mechanics. Their field theory 
analogues allow us to address the problem of tunnelling in quantum field 
theory. In particular we will address the apocalyptic question of the end 
of the world! We know that when a system breaks a symmetry the 
result is that the vacuum of a system is one of (potentially) a number 
of equivalent vacua. This is presumably the case in our own Universe. 
But what would happen if the vacuum that the system adopts is not 
actually the lowest energy state? What if there is one with a slightly 
lower energy? If this is the case with our own Universe then we might 
worry that the Universe will undergo a transition, via a tunnelling event, 
into the true vacuum with catastrophic consequences for those of us who 
have grown dependent on the physics of the current (false) vacuum. 1 

We start by examining instantons. These are very similar to the kinks 
we examined in Chapter 29 which exist as stable entities localized in 
space. Instantons are constant energy solutions to quantum mechanical 
equations of motion with a characteristic, stable structure localized in 
time. The name ‘instanton’, coined by ’t Hooft, reflects this particle¬ 
like existence in time, rather than space. Instantons are associated with 
critical points (local maxima, minima or saddle points) of the action. 


1 This problem was called the ‘The fate 
of the false vacuum’ by Sidney Cole¬ 
man and our approach to this problem 
follows his treatment closely. See the 
superb lecture in his collection Aspects 
of Symmetry for the full story. 
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2 In Chapter 25 we used the Wick ro¬ 
tation, which turns Minkowski space- 
time into Euclidean spacetime via the 
rotation taking x° —> — ir, purely as a 
means of relating quantum field theory 
to statistical physics. In this chapter we 
will see that working in Euclidean space 
has the advantage of leading to new 
and useful solutions to the equations 
of motion of quantum particles. Note 
that here we do not impose the peri¬ 
odic boundary conditions of statistical 
physics and so imaginary time stretches 
from — oo < r < oo. We will also call 
the time interval over which the calcu¬ 
lations are carried out T in Minkowski 
space and Te in Euclidean space. 


^ See Chapter 23. 


50.1 Instantons in quantum particle 
mechanics 


To reveal the physics of the instanton we will temporarily leave behind 
fields and deal with single-particle quantum mechanics, albeit described 
by the path integral and Green’s functions. We will examine a single 
particle in one spatial dimension described by a Lagrangian L = p 2 /2m— 
V(x). This gives rise to an action S = f^f^dtL and a path integral 
G = J V[x(t)\ e lS / h which gives us the Green’s function, or amplitude, for 
any given scenario. (For simplicity we w r ill henceforth assume the particle 
described here has unit mass.) We now make a switch to Euclidean 
space, 2 where we have a Euclidean action 


•S'e 


J-Te/2 2 \d T / 


V(x) , 


(50.1) 


and a path integral 

G(x f , T e / 2, a*, —T e /2) = J V[x{t)] e~ s ^ h , (50.2) 


where G is the Green’s function describing the amplitude for starting at 
(—Te/ 2, Xi) and ending up at (Te/2, Xf), and the integral is carried out 
over all trajectories that have this property. 

Let’s suppose a stationary trajectory exists called x(t). Applying the 
Euler Lagrange equations to the Euclidean action leads to 


d 2 x dV(x) 
dr 2 da; 


(50.3) 


which differs from the usual equation of motion for a particle in a poten¬ 
tial by a minus sign. Doing our analysis in Euclidean space has resulted 
in equations of motion that seem to describe the motion of a particle in a 
potential —V(x). As a result, the constant of the motion corresponding 
to energy is given by 




V(x). 


(50.4) 


The solutions to the equations of motion that minimize the action have 
constant energy and so this equation for E enables us to identify them 
very conveniently. 


Example 50.1 

In order to find the Green’s functions which describe amplitudes in this potential we 
will use what is known as the stationary phase approximation whose guts are 
described here. Since we know that the path integral will be dominated by those 
paths closest to the stationary trajectory 3 , we split the integral into 

G = e —(Stationary action)/fi x ( quantum corrections). (50.5) 
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The key to solving problems is then to find the stationary action for the situation 
we are trying to describe and expanding around this point. The implementation of 
this approximation becomes clear if we look at a simple integral X = f dx . 

Expanding f(x) about a stationary point at x we have 


f(x) = f(x) + i/"(x)(x - x) 2 + ... 

The second term will present us with a Gaussian integral and so we obtain 

i 

2tt 


p-ZO) 


/"(*) 


(50.6) 


(50.7) 


This result carries over to the case of the functional integral and we obtain an ex¬ 
pression for the Green’s function, for our Euclidean theory of a unit mass particle, 
given by the stationary phase approximation as 


G(xf , Te/2, X {, —T e / 2 ) 


JY e -S[x]/h 


det 




1 

2 


(50.8) 


which is accurate to order O(h), which will be sufficient for our purposes, and where 
N supplies the normalization. 


The conclusion from all of this scene-setting is simply that using the 
Euclidean action has the effect of flipping the sign of the potential. The 
point of this is that there are stationary solutions which exist in the 
Euclidean world of upside-down potentials which may be analytically 
continued back to Minkowski space. We therefore potentially gain access 
to lots of new possibilities for the motion of particles to which we were 
previously ignorant. One of these is tunnelling, which would never be 
predicted by calculating the quantum corrections to a stationary path 
found in Minkowski space. 

In the next sections we will build some confidence in this approach by 
examining the stationary solutions that follow from some simple upside- 
down potentials. 


V 



Fig. 50.1 A potential well. 

50.2 A particle in a potential well 


Example 50.2 


As a warm up exercise in getting used to the formalism, we will examine the case of a 
particle in a potential well such as that shown in Fig. 50.1, which has V(x = 0) = 0. 

In the Euclidean version of the story the well is turned upside down into the 
potential hill shown in Fig. 50.2. If we impose boundary conditions that we want 
Xi = Xf = 0 then clearly the only solution is for a particle to balance at the top of the 
hill. This involves the particle stopped at x = 0 for the entire interval Te and so has 
action S(x) = 0. Using our equation for the Green’s function we find the quantum 
amplitude that the particle starts at x = 0 at time r = —Te/2 and is found there 
again at r = Te/2 is given by 


<3(0, T e /2, 0, -T e /2) = N 


det 



1 

2 


(50.9) 


where a; 2 = V^O). Recall that we’ve previously solved a related (exactly solvable) 
path integral problem: that of the simple harmonic oscillator potential. In that case 
we concluded that (in Minkowski spacetime) we have 


G(0, T/2,0, —T/2) 


(0|e _iwT/ft |0) = N 


det 



1 

2 


(50.10) 
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Fig. 50.2 An upside-down potential 
well, inverted by the Wick rotation to 
Euclidean space. 
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V 



Fig. 50.3 The double potential well. 


where ujq = V’ /, (0) and V(x) = ^lJqX 2 . The stationary phase version above is simply 
an approximate generalization of this result with a sign change due to our working 
in Euclidean space. 

Returning to the more general potential and to Euclidean space we see that work¬ 
ing out G is made difficult by the determinant. However, we can argue the general 
form of the answer. We know that in Euclidean space the Green’s function will look 
like G = (0|e -ifTE /^|0) = e ~ E ° TE / h , and we expect that the ground state energy 
should be something close to Eq ~ ^frw for small oscillations in a potential well, 
suggesting that G oc In fact, for large Te, it may be shown that 

d 2 


G(0,T e /2,0,-T e /2) = N 


det — 


dr 2 


\ 2 ( 
7T H' 




(50.11) 


and we read off the ground state energy as Eq = [1 + O(h)], as we expect. 


We’ll follow the same procedure again for more interesting potentials. 
The procedure is to write down the potential and shift it to Euclidean 
space, find the stationary configuration, work out its action and then 
find the approximate Green’s function using eqn 50.8. 



Fig. 50.4 The double potential well, 
turned upside-down by the Euclidean 
rotation. 


—V 



50.3 A particle in a double well 

The instanton solution is found when we repeat the procedure outlined 
above for the double potential well. We define the well with minima 
at ±a and take V(—x) = V(x). We also define V"{±a) = to 2 . The 
potential for the double well is that shown in Fig. 50.3 and shifting 
to Euclidean space turns it upside down, as shown in Fig. 50.4. We 
look for solutions to the equations of motion with constant energy with 
boundary conditions that the particle itself is stationary at r = ±Te/2. 
These are useful since we will eventually make Te very large and so 
they will represent well defined solutions of finite energy. There are 
two obvious sets of possible solutions: one involves a particle fixed at a 
or at —a, just as we had in the previous section. However, there is a 
more interesting possibility: the particle starts at —a at — Te/2 and ends 
up at a at Te/2 as shown in Fig. 50.5. This is the instanton solution. 
Viewed with the potential flipped the right way up, the end points of 
this solution correspond to a particle tunnelling though the barrier. 


Example 50.3 

We may calculate the properties of the instanton. With the potential as defined in 
Fig. 50.5 this solution has E = 0, so we have 


^ = (2F)i 

dr 


The action of an instanton is given by 


So = 


h H 


■ V 


/*(!) -/> (2V)i 


At large Te we have that x = uj{a — x) leading to 

(a — x) oc e~ UJT , 

which tells us that the (temporal) size of an instanton is 1/a;. 


(50.12) 

(50.13) 

(50.14) 


Fig. 50.5 The instanton solution. 
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The previous example demonstrates that the instanton has a well defined 
structure in time of size r ~ 1 /u>. It is shown in Fig. 50.6, where its 
temporal extent is given by the width of the region where it crosses the 
axis. Notice the similarity between this and the kink in Chapter 29. 
(The kink represented a well defined structure in space, with a spatial 
extent l ~ 1/m.) 

In order to use the instanton to predict the properties of the double 
well we must allow the possibility of the existence of more than one tun¬ 
nelling event. This corresponds to examining more than one instanton. 
This is the subject of the next example. 





Example 50.4 

We will now examine the case of a dilute gas of instanton objects. Using the same 
double well, the ‘gas’ corresponds to a particle on top of one hill that occasionally 
rolls down the hill and up to its neighbour, and then some time later rolls back down 
the hill and up to where it started. Some time later it undergoes another rolling 
event, and so on. The gas then comprises n rolls, constrained so that an instanton 
a —► —a must be followed by an anti-instanton —a —► a. (Recall that we had the 
same constraints on kinks and antikinks.) The gas is dilute in that each instanton 
has a size (in time) far smaller than the gaps between instantons and anti-instantons. 
The gas is shown in Fig. 50.7. This may the strangest picture of a gas it’s possible 
to draw! 

The action of n dilute instantons is given by nSo , where So is the action of one 
instanton that we calculated above. For a single well we had G(0, Te/2, 0, — Te/2) = 
(lu/ttK) 2 e _a;T E/2 > p or the gas in the double well we need to consider this amplitude 
multiplied by the contribution K from each of the n instantons: 

( — T e-“ T E/ 2 /y", (50.15) 

V irfi' 

where K is the amplitude for the occurrence of a single instanton. Since we don’t 
know where each instanton is centred in time, we must integrate this quantity over 
the centres of all of the instantons. This is conveniently carried out by noting that 


r T e / 2 rri 

r T n—i T£ 


/ dri / dr 2 ... 

/ dr " = — f- 

(50.16) 

'-Te/2 J—Te/2 

J-Te/2 n! 



Putting all of this together we have, for example, to order 0(h ), an amplitude for 
the particle to start and finish at x = — a of 


G(—a, Te/ 2, —a, —Te/2) 


f ^ \ 2 -uT b /2 ( Ke s °/ h T E ) 

\ivhy n e \ 


(50.17) 


where n e means that n only includes even numbers, ensuring that, however large we 
make n, the particle ends up where it started. 4 These sorts of sums may be done and 
the general result is that 


G 




1 

2 



exp (Ke ~^Te) =F exp (—Ke "J?Te) 


(50.18) 


We may read off the physics by comparing with e~ ET ^/ h as we did for the single 
well. In this case we find that we have two low-lying energy states with energies 
corresponding to whether we finished at —a (summing over even n) or +a (summing 
over odd n). The energies of the two states are given by 


E± 


-hw±hKe~ s ° /h . 

2 


(50.19) 


It’s worth pausing at this stage to take stock of what we’ve done by returning to 
Minkowski space and flipping the potential back the right way up. 


Fig. 50.6 The instanton as a tightly 
bound structure in (imaginary) time. 


X 



Fig. 50.7 A gas of instantons for the 
double well problem. 


Note that h has been reintroduced for 
clarity. 


4 The same argument says that amnesia 
may be avoided if one is hit over the 
head an even number of times. 
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Fig. 50.8 A potential with a barrier. 


-V 



Fig. 50.9 The barrier potential turned 
upside-down. 


- 1 / 



Fig. 50.10 The bounce solution: an¬ 
other example of an instanton. 


X 



The conventional quantum mechanics of the double well is instructive here. We 
have solutions that are localized in the left-hand well | — a) and in the right-hand well 
|a). We’ll call the energy for sitting in a potential minimum Uo, but we allow the 
Hamiltonian to permit barrier penetrating transitions between minima with matrix 
element (— a\H\a) = (cl\H\ — a) = —V. The solutions are 


75 (l — a ) + l a )) E = Uo — V, 
75(1 ~ |o>) E = U 0 + V. 


(50.20) 


The system saves some energy through the wave function spreading out over the two 
wells. The symmetric combination lies lowest, separated by an energy 2V from the 
antisymmetric combination. 

In the present case, we can identify Uo = ^ fko as the energy for sitting in 
the bottom of a well and V = HKe~ s o/ h . This makes sense as the difference in 
energies is proportional to the factor accounting for barrier penetration e~ s °/ h . 


Example 50.5 

We now examine tunnelling to freedom through a barrier as another example of the 
use of instantons. We start with the potential shown in Fig. 50.8 and up-end it by 
working in Euclidean space to obtain Fig. 50.9. The stationary solution of interest is a 
different instanton to those considered thus far. This instanton, shown in Fig. 50.10, 
involves the particle starting at x\ = 0 , travelling to x = a, where it bounces, reverses 
its direction and returns to its starting point at Xf = 0 . 

This is rather similar to the previous problem of the instanton gas, except that 
the Green’s function is given by summing over all n, since any number of instantons 
result in the particle back where it started at x = 0. We may carry out the sum to 
give 

G(0, T e /2, 0, -T e /2) = ) 5 e““ TE/2 exp [ftre“ So / fi T E ] , (50.21) 

which we might think tells us of an energy eigenvalue of Eq = + hKe ~ s °/ h . 

However, the interpretation of this solution is slightly more subtle. 

Let’s return to the right-way-up potential of Fig. 50.8. We’ve seen that the factor 
hKe~ s °/ h tells us about barrier penetration, but penetrating through the barrier in 
this problem results in the particle rolling away to infinity. This suggests that the 
bound state in the potential is unstable. Additional evidence arises when we plot the 
trajectory of the bouncing particle, which is shown in Fig. 50.11. The fact that the 
trajectory has a maximum tells us that the wave function must have a node. This 
means it’s not the lowest energy wave function and there must be a lower one with a 
negative energy. Since if is a function of the square-root of energy then this implies 
that K is imaginary. This fits with the idea of barrier penetration: we know that 
unstable states have energies with imaginary parts which, when plugged into e~ lEt 
give e — iE 0 t—rt w j iere is the decay rate of the state. What our instanton tells us is 
not a correction to the energy of a bound state, but rather the imaginary part of the 
energy, which gives is the decay rate. We conclude therefore that ImEo ~ h\K\e~ s °/ h 
and so the decay rate of the state is roughly given by 

r ss (50.22) 


Fig. 50.11 The trajectory of the parti¬ 
cle during the bounce has a maximum, 
signalling a node in the wave function. 



50.4 The fate of the false vacuum 463 


50.4 The fate of the false vacuum 

Finally we turn to fields. As stated in the introduction, our goal is to 
describe what happens to a broken symmetry system in the case that 
the vacuum that the system has chosen (known as the false vacuum) lies 
slightly higher in energy than another minimum in the potential, which 
we call the true vacuum. A potential describing this state of affairs is 
shown in Fig. 50.12. To be more colourful, we imagine that the vacuum 
of the Universe in which we live is a false vacuum (at <f> = <f> + ) and ask 
whether the Universe is about to tunnel through a potential maximum 
into the real vacuum (at </> = </>_). 

By analogy with the case of the magnet in Chapter 26, the catas¬ 
trophic quantum collapse of the false vacuum would seem rather un¬ 
likely. In the magnet example the process of tunnelling to the true 
vacuum would rely on each electronic spin in the system simultaneously 
tunnelling. We might have expected this to have a vanishingly small 
probability since there are so many particles. However, the possibility 
of instanton solutions in the field allow a scenario where a cluster of spins 
finds the true vacuum and then an instanton causes the spins forming 
the boundary of this ‘bubble’ to flip, causing the Universe to pass a 
tipping-point beyond which there is sufficient volume of true vacuum 
to make this the preferred phase. Whether a change in phase of the 
Universe actually occurs depends on the relative balance of the energy 
saving in realizing the true vacuum versus the cost of the domain wall 
between true and false vacua. 


Example 50.6 

Supercooled liquid presents us with a similar problem. In that case we cool a liquid 
which reaches a metastable state of liquidity, separated by a small energy from the 
true vacuum of solidity. A similar situation occurs for the superheated liquid, which 
remains in a metastable state of liquidity rather than evaporating. 5 

For the superheated liquid, thermal fluctuations often result in a small bubble of 
gas appearing. The inside of the bubble is the true ground state and so results in 
an energy saving e, per unit volume. On the other hand, the surface energy of the 
bubble costs energy a per unit surface area. We might write the energy 


E = — 


4 

3 


tv r 3 e + 47rr 2 cr, 


(50.23) 


shown in Fig. 50.13. Small bubbles, with a small volume energy saving compared 
to surface area cost will disappear. At some point a bubble will be large enough 
that it is energetically favourable for it to increase in size. By finding the stationary 
point in E we can work out how large a bubble must be to become ever-expanding. 
Extremizing, we find a maximum in E at 


r = —. (50.24) 

e 

Bubbles larger than this find it energetically favourable to expand. As this occurs, 
more and more space is taken up by the true vacuum, which eventually spreads 
throughout all space. 


U 
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Fig. 50.12 False vacuum potential. 


^ Although in this case, you should note 
that there’s no difference in symmetry 
between a liquid and a gas, so the anal¬ 
ogy with symmetry breaking no longer 
carries. 

E 


A 



Fig. 50.13 The energy of a bubble of 
gas of radius r in a superheated liquid. 
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6 again setting h = 1. 


We examine the decay of the false vacuum by analogy with the particle 
problems looked at so far in this chapter. The decay process is described 
by a bouncing instanton, which gives a decay rate. Since in quantum field 
theory we deal with a path integral 6 f V<p e _ -T d x £b , then the quantity 
we will find is T/V, a decay rate per unit volume, which we expect to 
be given by 

T/V = (50.25) 

where So is the action of the instanton bounce. Calculating the approxi¬ 
mate action of the bounce will be possible: since the minima of the false 
vacuum potential are only separated by a small energy difference, they 
closely resemble the exact double well problem for which we know the 
action. 

Let’s now be specific. We consider a quantum field theory in four¬ 
dimensional Euclidean space. This has an action 


S = 


d 4 x 




U(<t >) 


(50.26) 
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Fig. 50.14 False vacuum potential in 
Euclidean space. 
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Fig. 50.15 The bubble in Euclidean 
space. 


We cook up a symmetric, double-well potential f7 sym , which we then 
modify slightly to obtain the false vacuum potential U: 

U = U sym + e{(j) - a)/2a. (50.27) 

The potential has minima at <j>± = .4 a. although that at (j>+ = a lies at 
a slightly higher energy, by an amount e (Fig. 50.12). 

We will follow the procedure we developed for identifying instantons 
in particle mechanics. We may employ the results from the particle, as 
long as we remember that the imaginary time r in the Euclidean particle 
problem has become a spacetime point r in four-dimensional Euclidean 
space. The Euclidean potential is upside-down as shown in Fig. 50.14. 
The bouncing instanton clearly starts on the hill of the false vacuum 
and bounces from the point <f> c , where U(<f>) cuts the 0-axis. 

We can also imagine this running backwards, with the particle starting 
at a point </> c near the true vacuum at 0_. We choose to look at it this 
way around because the bubble that will result in 4-space then resembles 
the bubble in the superheated liquid with true vacuum within and false 
vacuum without (shown in Fig. 50.15). The particle will roll down the 
hill and come to a rest exactly on top of the hill at 0 + (the false vacuum). 
We’ll imagine this takes place very rapidly around some point in time r 
that we’ll call r = R. Back in field language we ask what the hill-roll 
looks like. In Euclidean space we have a large spherical bubble of radius 
r = R. This has a thin wall representing the rolling process. (The 
wall is thin because the roll takes place very quickly.) The bubble wall 
separates the false vacuum on the outside of the bubble from the true 
vacuum on the inside. The wall of the bubble contains the instanton. If 
R, the spacetime point where the instanton is centred, is greater than 
the critical radius R c above which the bubble is self-sustaining, then the 
instanton creates a bubble that will drag all of spacetime into the true 


vacuum. 
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We will now construct an action S for the rolling process. As r in¬ 
creases we start inside the true vacuum bubble, move through the wall 
and end up outside. The action is given for this spherically symmetric 
problem by 

r°° 1 1 

So = 2ir 2 / drr 3 

Jo 




2 V dr 


(50.28) 


Unlike the previous examples, where only the instanton part contributed 
to the action, we need to be more careful here. Inside the bubble we are 
in the true vacuum where we have <j>- and U = —e, so only the second 
term in the integral contributes, yielding an action 

Strue vacuum = -^n 2 R A £. (50.29) 

Over the thin surface of the bubble we have a contribution from the 
instanton part: 


-^instanton 


2t t 2 R 3 


dr 
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2 
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sym 


(50.30) 


where, as advertised, we choose to neglect the difference e in energy 
between the minima and so only need consider the symmetric double 
well potential U sym . We know that the instanton solution takes us from 
—a to a as r increases through R. By analogy with the particle problem, 
such an instanton has an action Si = /“ a d(j> (2£/ sym ) 5 . This gives us a 
contribution to the action from the instanton of 


S ins ta„to„ = 2 7 r 2 i? 3 S 1 . (50.31) 

Outside the bubble we are in the false vacuum where <f> = <f>+ and U = 0, 
which makes no contribution to the action. The total action is therefore 
given by 

S tot = - \k 2 R*£ + 2tt 2 R 3 S 1 . (50.32) 

We may now extremize the action to find the critical radius R c at which 
the tunnelling destroys the false vacuum. We vary the total action with 
respect to R and find a radius R c = 3Si/e. Substituting this back into 
the expression for the total action of the bubble we predict a rate of 
tunnelling to annihilation of 


t 

i 



Fig. 50.16 Spacetime diagram show¬ 
ing the growth of the bubble. An ob¬ 
server at O only knows of the bubble 
when its forward light-cone intersects 
the observer’s world-line. A time R c /c 
seconds later the bubble wall intersects 
the observer’s world-line. 


T/V « | K\e~ Stot , 


(50.33) 


where Stot = 27n 2 Sf/2e 3 . 

Now we swap back into the Minkowski space of our Universe, and ask 
how the Universe w T ill end. An analytical continuation of these results 
back to Minkowski space shows that the shape of the bubble in Euclidean 
4-space is the same as that of the bubble in (3+1) dimensions. As R 
increases, the bubble expands, sweeping out a region in spacetime given 
by the hyperboloid 


l*| 2 - ( ct ) 2 = Rl 


(50.34) 
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of the: 
V/orJ> 

\s 



We might expect that since the initial bubble is created by quantum 
mechanical fluctuations then R c should be a very small number, perhaps 
of the order of 1 frn. This would mean, from eqn 50.34, that \x\ 2 ss 
( ct ) 2 and the bubble expands at almost the speed of light. This, rather 
inconveniently, gives an observer effectively no warning of the coming of 
the true vacuum. Looked at on the spacetime diagram in Fig. 50.16 we 
see that the observer O doesn’t know about the existence of the bubble 
until the light from the forward light-cone of the bubble’s outer wall 
meets her world-line (see Fig. 50.17). We see that the true vacuum itself 
collides with her world-line a time R/c seconds later, and she ceases 
to exist. This time (by assumption, ss 1 frn/c seconds) is orders of 
magnitude shorter than the time it takes a neuron to fire. In Sidney 
Coleman’s words she ‘has literally nothing to worry about’. 


Fig. 50.17 We have essentially no 
warning of the coming of the true vac¬ 
uum. However, we can give some ad¬ 
vance notice of the imminent end of this 
book. 


Chapter summary 

• Instantons are kink-like solutions of the equation of motion asso¬ 
ciated with critical points in the action and allow tunnelling prob¬ 
lems to be treated. 

• Instantons can be used to study the fate of the false vacuum in 
a symmetry broken system, such as the Universe, where the true 
vacuum state is only a tunnelling event away. 


Exercises 


(50.1) Verify the expression for the stationary phase ap- (50.2) Verify the solution of the double well problem in 
proximation in eqn 50.8. eqns 50.20. 




Further reading 



Many of the ideas in the preceding essay have been taken 
from another... 

John Berger (1926- ) 


There are many other excellent books which introduce and describe 
quantum field theory. 1 Whether you are after the friendly approachabil- 
ity of Mattuck or Zee, or perhaps the elegant but unforgiving grandeur 
of Weinberg, or something in between, there is something for everyone. 
We particularly like Peskin and Schroeder, and also Ryder, and either 
(or both) could serve as a useful complement to this book. Many of the 
arguments we have used are adapted from these books (Zee and Peskin 
and Schroeder deserve special mention here) and also from the legendary 
lecture course given by Sidney Coleman. 2 Finally, for those eager to prac¬ 
tise, a useful set of solved problems may be found in Radanovic. 


1 See the detailed list below, beginning 
on page 470. 


o 

These mid-1970s lectures are, at least 
at the time of writing, viewable on the 
Harvard Physics website. 


Further reading by chapter: 


Chapter 1: An accessible introduction to Lagrangian mechanics is given 
in Feynman, Leighton and Sands. See Landau and Lifshitz (vol. I) for the 
full story. Chapters 2 and 3: The simple harmonic oscillator and occupa¬ 
tion numbers are explained in most books on quantum field theory. The 
books by Aitcliison and Hey, 3 P. Coleman and Feynman are especially 
clear. Chapter 4: Several examples of non-relativistic applications of sec¬ 
ond quantization may be found in P. Coleman. Chapter 5: Classical held 
theory is covered in Maggiore, Ryder and Itzykson and Zuber amongst 
many others. Chapter 6: The Klein-Gordon equation and the problems 
with its interpretation are explored in Aitchison and Hey. Chapter 7: 
The Lagrangians we describe are all examined in most books on quantum 
held theory. Ryder, Huang, and Zee are good sources of information. 
Chapter 8: Time evolution operators and related issues are discussed in 
most books on advanced quantum mechanics (Ziman, for example) and 
in the lectures by S. Coleman. Chapter 9: Transformations are described 
very clearly in Ryder. For a more sophisticated treatment see Weinberg. 
Chapter 10: An in-depth discussion of Noether’s theorem may be found 
in Neuensclrwander. Both Ryder and Huang provide accessible introduc¬ 
tions. Chapter 11: Our treatment of canonical quantization follows S. 
Coleman’s lectures. Aitchison and Hey, Bjorken and Drell 4 (RQF) and 
Schiff are other sources of clear information. Chapter 12: The properties 
of the complex scalar held are examined in nearly all books on quan- 


3 Assume volume I of multivolume texts 
unless stated otherwise. 


1 We abbreviate Bjorken and Drell’s 
Relativistic Quantum Fields as here 
RQF to avoid confusion with their Rel¬ 
ativistic Quantum Mechanics (RQM). 




468 Further reading 


turn field theory [see Aitchison and Hey or Bjorken and Drell (RQF)]. 
Our discussion of its non-relativistic properties follows Zee. Chapter 13: 
Internal symmetries and vector fields are discussed in Ryder, Zee and 
in Aitchison and Hey. Chapter 14: Gauge theory and its consequences 
are very clearly described in Aitchison and Hey who also give a good 
exposition of the difficulties inherent in the canonical quantization of the 
electromagnetic field. The solution to these difficulties is discussed in 
Peskin and Schroeder. Chapter 15: Charge conjugation, parity and time 
inversion are described in Aitchison and Hey and in Bjorken and Drell 
(RQF). A more sophisticated account is Weinberg. Relevant background 
on the mathematics may be found in the books by Georgi and Nakahara. 
Chapter 16: Green’s functions are introduced in Mattuck in a similar 
way to that described here. An approach to perturbation theory based 
on propagators may be found in Scliiff. Classical Green’s functions are 
described in Barton. Chapter 17: The Feynman propagator is discussed 
in Aitchison and Hey. The non-relativistic version is discussed in Mat¬ 
tuck. Chapter 18: Useful information on the S'-matrix may be found in 
Aitchison and Hey, and also Peskin and Schroeder. Weinberg provides 
the philosophy. Chapter 19: The expansion of the S'-matrix is carried 
out for a simple toy theory in Aitchison and Hey, while the <)> 4 theory is 
discussed in Peskin and Schroeder and in Ryder. Chapter 20: Scatter¬ 
ing is covered by most books on quantum field theory and on particle 
physics. An introduction to scattering for particle physics applications 
is given in Griffiths. Clear treatments from the field theory perspective 
appear in Peskin and Schroeder (who discuss ip'ip (p theory) and also in 
Weinberg. Chapter 21: Statistical physics is described in Blundell and 
Blundell and at a more advanced level in Chaikin and Lubensky. Linear 
response theory is described in Binney, Newman, Dowrick and Fisher. 
The analogy between field theory and magnetism is described in more 
detail in Peskin and Schroeder. See also Zinn-Justin. Chapter 22: Gen¬ 
erating functionals are described in Ryder, Peskin and Schroeder, and 
in Binney, Newman Dowrick and Fisher. Chapter 23: The path integral 
is described in Zee and Ryder. A different approach is described by 
its inventor in Feynman and Hibbs. Chapter 24: Functional integrals 
are clearly described in Zee, Peskin and Schroeder and in S. Coleman. 
Chapter 25: The Wick rotation is discussed in Peskin and Schroeder 
and Altland and Simons. Kapusta and Gale is a good book on the 
applications of statistical field theory and Mahan describes how these 
techniques are used in solid state physics. Chapter 26: Broken symme¬ 
try is discussed in Blundell and in more detail in Anderson. A lively, 
advanced, field-centred treatment may be found in S. Coleman. Chap¬ 
ter 27: Coherent states are introduced in Annett and in Loudon. The 
history of the phase operator problem is described in the paper by Nieto, 
arXiv:hep-th/9304036vl. Altland and Simons and P. Coleman describe 
coherent states for field applications. Chapter 28: Grassmann variables 
are treated in Chapter 44 of Srednicki, and in great detail by Negele and 
Orland. Chapter 29: Zee is a very clear source on topological effects and 
we have followed his treatment in our presentation. Topological objects 
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are described in Zee and Ryder, with more advanced treatments in S. 
Coleman, Weinberg, Nakahara, and in Altland and Simons. Chapter 30: 
Our description of topological field theory follows Zee and the notes by 
Dunne (arXiv:hep-tli/9902115vl). It is examined in more advanced form 
in Wen. Chapter 31: Quasiparticles are described in Mattuck and P. 
Coleman. The Landau Fermi liquid is introduced in P. Coleman. Chap¬ 
ter 32: The philosophy of renormalization is made clear in Peskin and 
Schroeder and in Weinberg. Chapter 33: The use of renormalization in 
perturbation theory is stressed in Peskin and Schroeder. Chapter 34: 
Our presentation of the philosophy of the renormalization group is sim¬ 
ilar to that of Zee and of Peskin and Schroeder. Good sources on the 
use of the method may be found in Altland and Simons, Peskin and 
Schroeder, Binney, Newman, Dowrick and Fisher and McComb. Chap¬ 
ter 35: More detail on ferromagnetism and the renormalization group 
may be found in Altland and Simons. Chapter 36: There are many dif¬ 
ferent approaches to introducing the Dirac equation. Aitchison and Hey 
and Ryder are good places to start. A more modern approach is found 
in Maggiore. We have followed the approach in Penrose. Chapter 37: 
The transformation of spinors is discussed in Ryder and Maggiore and 
in more detail in Weinberg. Chapter 38: The quantum mechanics of 
fermion fields is introduced very clearly in Peskin and Schroeder and 
in Bjorken and Drell (RQM and RQF). Chapter 39: Simple examples 
from QED feature in Aitchison and Hey, Mandl and Shaw and Peskin 
and Schroeder. The history of the subject is explored in Schweber. 
Chapter 40: The examples of scattering in QED covered in this chapter 
are more fully explained in Peskin and Schroeder. See also Berestet- 
skii, Lifshitz and Pitaevskii for lots more examples. Chapter 41: The 
renormalization of QED is covered clearly and in detail in Peskin and 
Schroeder. Chapter 42: Bolgoliubov’s treatment is described in Zirnan. 
The Lagrangian treatment of a superfluid may be found in Zee and in 
more detail in Wen. See also Fulde. Chapter 43: An introduction to 
the field theory of metals may be found in P. Coleman. See the detailed 
tome by Mahan for the full story and also Giuliani and Vignale for more 
background. Chapter 44: Our treatment of superconductivity in terms 
of second quantized operators follows Annett. The Lagrangian formu¬ 
lation is described by Wen. See also Weinberg (vol. II) and Aitchison 
and Hey (vol. II). Chapter 45: The field theory of the fractional quan¬ 
tum Hall effect is described straightforwardly in Zee and in more detail 
in Ezawa and Wen. Chapters 46 and 47: Non-abelian gauge theory 
and the Weinberg-Salam model are described in Ryder, in Peskin and 
Schroeder and, of course, in Weinberg (vol. II). An approach motivated 
by particle physics phenomenology is presented in Aitchison and Hey 
(vol. II). Chapter 48: Majorana fermions are described in Aitchison and 
Hey (vol. II) and in a review by Pal (arXiv:1006.1718v2). Chapter 49: 
Magnetic monopoles are described in Ryder, Zee and by S. Coleman. 
Chapter 50: Our treatment of instantons is based very closely on the 
lecture in the collection by S. Coleman. Appendix B: Useful background 
may be found in Penrose. Our treatment follows that of Boas. 
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Useful complex analysis 



For every complex problem there is an answer that is clear, 
simple, and wrong. 

H. L. Mencken (1880-1956) 

Throughout the book we have tried to keep the amount of complex 
analysis to a minimum. This appendix provides a simple guide to some 
of the complex analysis commonly employed in quantum field theory. 
The guide is illustrated by examples drawn from the subject, including 
the most important function of a complex variable in quantum field 
theory: the propagator. 


B.l What is an analytic function? 

473 

B.2 What is a pole? 474 

B.3 How to find a residue 474 
B.4 Three rules of contour inte¬ 
grals 475 

B.5 What is a branch cut? 477 

B.6 The principal value of an in¬ 
tegral 478 


B.l What is an analytic function? 


If a function is analytic in a region close to a point z, then it has a 
derivative at every point in that region. We define the derivative of a 
complex number as 


f(z) 


d/ = lim A f(z + Az) - f(z) 
d z Az->o Az 


(B.l) 


Importantly, for the function to be analytic, the derivative shouldn’t 
depend on the way the interval in the complex plane A z is selected. 


Example B.l 

The function f(z) 


= z 2 is analytic; g(z) 


/'(*) 


lim 
Az—>0 


= \z\ 2 is not. To see the first write 
(z + Az) 2 - z 2 
Az 


lim 
Az—>0 


z 2 + 2z + (Az) 2 - z 2 
Az 


= 2z, 


(B.2) 


just as for a normal derivative. Note that we didn’t have a choice of Az, this procedure 
works whatever we choose. 

On the other hand for g(z) = \z\ 2 we have 


9 1 (*) 


lim 
Az—^0 


\z + Az\ 2 
Az 



(B.3) 


If Az = \Ay (with y real) then you will get a different derivative to the case of 
Az = Ax, with x real. See Boas for more details. 
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1 By singularity, we mean a point where 
a mathematical object is not defined. 


r + ■ • • (B-4) 


B.2 What is a pole? 

A pole is a type of singularity 1 that behaves like the singularity of 1/z n 
at z = 0. Let f(z) be analytic between two circles C\ and C%. In the 
region between them we can write f(z) as a so-called Laurent series 
expanded about a point zq: 

f(z) = ao+ai(z—Zo)+ci2{z—Zo) 2 + ..- -y + y-cy + • • • (B.4) 

(z-z 0 ) (. z-z 0 y 

The part with b coefficients is known as the principal part of the se¬ 
ries. Don’t confuse this with the principal value of an integral, which is 
different, and discussed below. 

A result and some definitions: 

• If all 6’s are zero then /(z 0 ) is analytic at z = Zq. 

• If all b's after b n are zero then we say that we have a pole of order 
n at z = zq. If n = 1 we have a simple pole. 

• The coefficient b\ is called the residue of f(z) at 2 = Zo ■ 

An important example is the function 

, a 


m = 


z-(3 ’ 


which has residue b\ = a and all other at and 6, zero. This function has 
a simple pole at z = f3. 


—Ep 

• 1 £ 

Ep 

— £ 

- • ^ 


Example B.2 

Let’s examine the pole structure of two of our propagators. The non-relativistic, 
retarded, free electron propagator is given by 

(R6) 

This has a first-order pole at E p — ie. This is shown in Fig. B.l(a). 

The Feynman propagator for the free scalar field is usually written 

A (p)-a-; (B.7) 

p z — m z + ie 

this is helpfully rewritten (see Chapter 17) as a function of the complex variable p°: 


A(P) 2 E p _(p°)-E p + ie (p<>) + E p - ie. ' (B ' 8) 

The first (particle) part has a simple pole at p° = E p — ie. The second (antiparticle) 
part has a simple pole at —E p + ie. This is shown in Fig. B.l(b). 


Fig. B.l (a) Position of the pole in 
the complex E plane for Gq(E). (b) 
Positions of the poles in the complex 
po plane for the Feynman propagator 
for free scalar fields. 


B.3 How to find a residue 

You can find a residue R(zo) at the pole Zo by writing a Laurent series. 
There are more direct methods too. For example, when we have a simple 
pole, we can write 

R(z 0 ) = lim (z - zo)f(z). (B.9) 

2— >Zq 
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Example B.3 

The residue of G(E ) = E _ l E _|_ ie at the simple pole E = E p — it is given by 


R{E P -it) = lim (E - E p + ie)—- = i Z. (B.10) 

—* Esp — ie Jjj — j tLp T 16 


B.4 Three rules of contour integrals 

A contour C is a closed path in the complex plane with a finite number 
of corners which doesn’t cross itself. Integrals around such contours have 
a number of useful properties. 


Example B.4 

We can get some practice with a contour integral by calculating 


dz z 2 , 


(B.ll) 


where the contour is shown in Fig. B.2. We split the contour into two parts and start 
with the straight line along the real axis. Take z = re 10 and this part becomes 


(straight line) = 


f 

J r= — 1 


d rr = 


(B. 12) 


Now for the semicircle, described by z = roe 10 , where ro = 1. We have dz = iroe 10 d# 
giving 


(semicircle) = 


n'K 

/ dQ i 

Jg =o 


irge 3i0 = 


(B.13) 


Adding the contributions we have 


(line) + (semicircle) = 0. 


(B.14) 


There are three useful theorems for evaluating integrals taken around 
contours. The first is Cauchy’s theorem: 


If f(z) is analytic on and inside C then 
y d zf(z) = 0. 


(B.15) 


This is good news, since it says that if the region in a contour contains 
no poles then the integral gives zero. It also explains why the previous 
example gives zero: z 2 is analytic on and inside the contour. 

The second theorem is known as Cauchy’s integral formula: 


Augustin-Louis Cauchy (1789—1857) 


i 11 



Fig. B.2 An example of a contour in 
the complex z plane. 
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Fig. B.3 (a) The contour completed in 
the lower half-plane enclosing the pole. 
Note that the direction here is clock¬ 
wise, earning us an extra minus sign, 
(b) The contour completed in the up¬ 
per half-plane. No poles are enclosed 
so the answer is given by Cauchy’s the¬ 
orem. 


2 

This disappearance of semicircular 
contours in the limit of infinite radius 
is a result of Jordan’s lemma. This 
says that the integral f^° oo dzf(z)e iaz 
along the infinite upper semicircle is 
zero, provided (i) a > 0, and (ii) f(z) 
is a well-behaved function satisfying 
lim fl _ 00 |/(-Re 1 ®)| = 0. 


If f(z) is analytic on and inside a simple closed curve C, and the point 
a is inside C , then the value of /(a) is given by 

/(a) = d z-^-. (B.16) 

2tti J c z — a 

The third is the residue theorem: 

If f(z) has singularities at points Zi, then, for a closed curve enclosing 
these points we have 

£.,/(,) = 2 ^(“V (Zl) ). (B.1T) 

where the integral around C is performed in the anticlockwise direction. 
(You merely change the sign of the answer if you perform the integral 
in the clockwise direction.) 

Often we want to do difficult integrals over real variables. These may 
be turned into easier integrals if we form a contour in the complex plane 
which includes the original domain of integration and use the rules given 
above. The art is in choosing the best contour to do the integral. 


Example B.5 


We can use Cauchy’s theorem along with the residue theorem to justify some of the 
more seemingly cavalier tricks employed in the discussion of propagators in Chap¬ 
ters 16 and 17. Let’s find the inverse Fourier transform of the retarded propagator 
Go (E), given by 

r°° d E 


G+(t-t’) = I 


—oo 27 t E — E p -\- ie 


(B.18) 


for which the integration path is along the real axis. To use our contour integral rules 
we must complete the contour by joining up this path with a further section of path 
which will either be in the upper half of the complex E plane or in the lower half. 

Suppose we take it in the lower half-plane. Then, as we take the limits along the 
real axis to zboo the semicircular path gets larger and larger. This will make a large, 
negative imaginary contribution to E. Let’s call it —i|^|. The exponential will then 
involve a contribution e I’?!* ). If (t — t') is positive then this contribution gets 
smaller, eventually vanishing as the contour becomes infinitely large. 2 We conclude 
that, for the case t — t' > 0, the integral above is equivalent to the contour shown in 
Fig. B.3(a). 

Let’s do that integral. The contour contains the pole at E = E p — ie so we use 
the residue theorem to say 


f d E ie lE ( b * ) _ / Residue at \ 

C 27t E — Ep + ie \ E — E P ~ ie / 


(B.19) 


where the minus sign follows from our attempt to take the integral in the clockwise 
direction. The residue at the pole is ie — iE p (t—t ) e —e(t—t ) an( j the answer is 


G+(t - t') = e- iS p( t - t ') e - e(t - t,) , 


(B.20) 


which we stress applied for (t — t') > 0. 
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What if we tried to complete the contour in the upper half-plane? Then we would 
have obtained a large, positive, imaginary contribution to the exponential resulting 
in a contribution e^ r] ^ t ~ t ) which blows up for t — t' > 0. Such a badly behaved 
integral is certainly not suitable for evaluating the Fourier transform. However, for 
t — t' <0 the semicircular contour has a vanishing contribution at infinity and we 
again have the equivalence of the Fourier transform and the contour C' shown in 
Fig. B.3(b). Notice that C' contains no poles, so Cauchy’s theorem says that the 
integral is zero. 

We conclude that G + (t — t f ) = 0 for t — t'< 0 and noting that e is an infinitesimal 
quantity, we may replace both (i) zero for t — t' <0 and (ii) e e d — * ) for t — t' > 0 
with 9(t — t f ) and conclude 

G + (t - t') = 0(t - t')e -i£; P ( * -t/) , (B.21) 

just as we had in Chapter 16 without the need for adding damping factors by hand! 


The meaning of the ie factors now becomes clear. These infinitesimals 
position the poles of the propagators in such a way as to ensure the 
correct causality relationships. Returning to the scalar field propagator 
of Fig. B.l we see that closing the contour in the lower half-plane picks 
up the positive energy pole, leading to a factor 0(x° — y°), while closing 
the contour in the upper half-plane picks up the negative energy pole 
and leads to the factor 6(y° — x°). This motivates the definition of the 
Feynman propagator in Chapter 17. 


B.5 What is a branch cut? 

The function e 10 is multivalued. We are always at liberty to add whole 
numbers of 2n to the argument in the exponential, i.e. e 10 = e i( 0 + 2 jr™) 
where n is an integer. The fact that this function is multivalued means 
that care must be taken when taking roots and logarithms. 



A 



VWVW/AVvW ► 


Example B.6 

For the case of the logarithm we have, taking z = re 1 ®: 

lnz = lnr + i0. (B.22) 

Clearly, for fixed r, In 2 takes different values for 9 and 9 + 27m, even though both 
choices correspond to the same z. 


We therefore agree that we should only consider angles in some interval 
in 9 of size 27r, known as a branch of the function. In order to make this 
clear in the complex plane, we can define lines that we will agree not 
to cross with any of our operations. These are known as branch cuts. 
The points from which these emerge are known as branch points. 

For In z the branch point is the origin and the branch cut may be taken 
along the positive real axis as shown in Fig. B.4(a). It may also be taken 
along the negative real axis, or indeed any convenient line. Crossing the 
branch cut makes the function jump by 27ri. 



Fig. B.4 (a) The branch cut along 

the positive, real axis, (b) The com¬ 
plex plane for eqn. B.23. 
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Useful complex analysis 


Recall from Chapter 8 the integral 

—^ r d\p\ \ p \e i WWe- it V\p \ 2 +™ 2 , (B.23) 

(2tt) \x\ J—co 

which we consider in the complex |p| plane. The square root in this 
equation \/p 2 + m 2 must be restricted to a single branch. The square 
root vanishes for \p\ = ±im. which are therefore the branch points. 
For convenience, we take the branch cuts to extend along the imaginary 
axis as shown in Fig. B.4(b). Notice that when we do the integral in 
Chapter 8 we can’t cross the cuts with our contour, so we must direct 
the contour around the cuts. 



9 9 

rn 4 m 


Fig. B.5 The pole structure of the full 
propagator in eqn. B.24. 


3 


Specifically we require 



for a < b and 



zboo, 


=F°°, 


for c > b. (That is, one sign in front of 
the oo is plus and one minus.) 


Example B.7 

Another occasion where we must consider a function with a branch cut is the full 
propagator discussed in Chapter 31, given by 


G(p) = 


iZ 


p 2 — m 2 + ic 


L 


30 d M 2 

:4m 2 27T 


p(A/ 2 ) 


p 2 — M 2 + ie ’ 


(B.24) 


which has a pole structure shown in Fig. B.5. The second term in the expression tells 
us to expect multiparticle contributions. These lead to a line of poles with infinites¬ 
imal separation between them. Such a line of poles is another way of describing a 
branch cut and so we draw a cut extending along the real axis from the branch point 
given by the two-particle production threshold (p 0 ) 2 ~ 4m 2 . 


B.6 The principal value of an integral 


The Cauchy principle value is a method of giving improper integrals 
a value. This is not complex analysis, but is often used in integrals 
involving the propagator. Suppose we want to evaluate the integral 
f dxf(x) but we have the problem that the integrand f(x) diverges at 
x = b (where a < b < c) and so both J ^ dx f(x) and / fc c dx f(x) will also 
blow up. 3 In that case we may take the Cauchy principal value of the 
integral, denoted by V and defined by 

V [ dx fix) = lim 
Ja e ^ 0+ 

This gives the integral an unambiguous value, as demonstrated in the 
example below. 


cb—e 


dx f(x) 


dxf(x) 


'6+e 


(B.25) 


Example B.8 

The integral 



dx 


x - 2’ 


(B.26) 
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is not well defined as the integrand diverges at x = 2. You can see that the two 
integrals Jq and J^ 10 both diverge, giving — oo and +oo respectively. 

To get around this we integrate from 0 up to 2 — € and then from 2 + e up to 10, 
thereby cutting out the troublesome part of the problem. We obtain 

h=fo~ e ^ 3 =^- ln 2, h = f£ e = In 10 - In 6. (B.27) 

We find that / = Ii + I 2 = ln5, independent of e. We may therefore take the limit 
e —> 0 and obtain an unambiguous result. We conclude that 

r 1 0 dx 

V -= In 5. (B.28) 

Jo x-2 


The principal value arises in quantum field theory when we want to do 
integrals of the form f a da: where /( x) is a complex-valued function 

and a and b are real, obeying a < 0 < b. In this case we use the following 
theorem which says that 

lim [ &xTTL =V f dx® Ti7r/(0). (B.29) 

e * 0 + J a x ± te J a x 

^Sometimes called the Dirac relation in 
the physics literature. 


Often we take f(x — xq) = 5(x — Xq) and obtain the identity, 4 


1 V 

— ww = — T i7r<5(a;o), 

Xq ± 1 £ Xq 


(B.30) 


as used in Exercises 22.1 and 31.3. 
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